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Section I 


INTRODUCTION AND SUMMARY OF RESULTS 

(i) Introduction 

This report covers one aspect of our total effort to understand the 
structure of the errors on the Ground Communications Facility (GCF) and provide 
error control (both forward error correction and feedback retransmission) on it 
for improved communication. Here we are concerned mainly with constructing a 
theoretical model of errors and obtaining from it all the relevant statistics for 
error control. Thus no specific coding strategy is analyzed in this report, 
although references are made in appropriate places to the significance to error 
correction of the distributions of certain error patterns as predicted by the 
model. The success of our continuing efforts in designing specific error correc- 
tion schemes on the basis of this GCF model will be reported elsewhere. 

Our model is based on the 4800 bps high-speed GCF dataline test run pro- 
vided by J. P. McClure [l] in March 1973, although we show that the same basic model 
is good for the 50 kbps wide-band data we analyzed earlier in [2], Indeed all the 
error statistics that are calculated for the high-speed dataline are also obtained 
for the wide-band dataline. As shown in Table 1, the high-speed data set consists of 
31 test runs on all the NASA lines between JPL and each of the outpost stations at 
Goldstone, Florida, Madrid (Spain), South Africa and Australia. McClure [1] has a 
detailed account of how the data were collected. There are two of the 31 test runs 
in which no errors are recorded (Madrid-JPL, duration 102 minutes; Go Ids tone -JPL, 
duration 146 minutes), but this perfect transmission is due to the line condition at 
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Table 1. Source of U.8 kbps data (adapted from McClure [1]) 


CTA 21 JPL 

DSS 1^ Gold stone 

DSS h2 Australia 


DSS 51 South Africa 

DSS 6l Madrid 

DSS 71 Florida 




Starting Time 

Duration 

Bit Error 

P(l|l) 


Origin 

Destination 

Day 

Hour 

MiniSec 

Rate, xl0"5 

Group 

k2 

21 

298 

02 

214:31 

1.30 

0.340 

/jnber 

21 

42 

298 

OP 

214:30 

4.50 

0.351 

' Amber 

71 

PI 

298 

23 

178:51 

1.25 

0.401 

Amber 

6l 

21 

299 

04 

102:06 

0 

0 

Green 

21 

61 

299 

04 

107:49 

2.70 

0.366 

Amber 

h2 

21 

299 

23 

192:42 

0.23 

0.285 

Green 

21 

42 

299 

23 

192:14 

0.36 

0.300 

Green 

ik 

21 

300 

03 

146:26 

0 

0 

Green 

21 

l4 

300 

03 

146:50 

0.18 

0 

Green 

71 

21 

300 

23 

52:22 

25.4 

0.410 

Red 

71 

21 

301 

00 

68:05 

51.51 

0.315 

Red 

71 

21 

301 

01 

24:36 

2.36 

0.186 

Amber 

6l 

21 

301 

02 

222:16 

15.1 

0.388 

Red 

21 

6l 

301 

02 

222:29 

1.16 

0.4i6 

Amber 

Up 

21 

326 

21 

118:11 

0.66 

0.399 

Green 

21 

42 

326 

21 

118:10 

1.13 

0.342 

Amber 

42 

21 

326 

23 

44 : 58 

0.02 

0 

Green 

21 

42 

326 

23 

44:57 

1.00 

0.015 

Amber 

51 

21 

333 

17 

191:02 

3.10 

0.325 

Amber 

6l 

21 

335 

16 

75:43 

0.98 

0.420 

Green 

21 

6l 

335 

16 

75:27 

2.56 

0.374 

Amber 

6i 

' 21 

335 

18 

99:51 

2.30 

0.324 

Amber 

21 

61 

335 

18 

99:51 

2.48 

0.154 

Amber 

42 

21 

335 

21 

170:12 

4.02 

0.271 

Amber 

21 

! 42 

335 

21 

170:15 

6.49 

0.323 

Amber 

61 

i 

340 

16 

[ 76:00 

0.51 

0.277 

Green 

21 

6l 

340 

16 

75: 

3.97 

0.392 

Amber 

6l 

21 

34o 

18 

100:36 

1.63 

0.405 

Amber 

21 

6i 

340 

18 

100: 

1.79 

0.353 

Amber 

51 

21 

340 

20 

i 168:00 

5.30 

0.385 

Amber 

21 

51 

340 

1 

1 

20 

151:58 

3986.0 min. 
^ 66. U hrs. 

6.51 

0.368 

Amber 
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that time of day rather than a permanent feature of the lines between those sta- 
tions. For example, the same Madrid-JPL link on another day (day 301) has one 

of the worst error rates recorded in the 31 runs. We follow McClure in dividing 

the different bit error rates obtained in the tests into Green, Amber and Red 
groups: the Green group consists of those with bit rates of less than 1 x 10 ^ , 

those in the Amber group have bit rates between 1 x 10"^ and 1 x 10"^, and in 

the Red group are those with a bit error rate higher than 1 x 10"^. Of the 31 
test runs, only 3 are in the Red group, 7 in the Green and 19 in the Amber group. 
(We discount the two error-free runs in the analysis^) 

A rate-one code built into the GCF modems causes a fixed pattern of errors 
after each random error on the channel. In the 4800 bps high-speed data these 
fixed errors occur at bit positions 18 and 23 away from each random error. The 
positions are 3 and 20 in the 50 kbps wide-band data. It is now being determined 
whether to remove this fixed error-causing code or process the received data to 
remove the errors after each transmission. 

It is not the high bit error rate, however, that makes this type of channel 
difficult to model. Rather it is the fact that the errors, when they do occur, 
tend to cluster together. In other words, the channels display some memory. How 
long or short a memory one should build into the model depends on the particular 
channel and the ease of handling the analysis of a model with a realistically long 
memory. On the GCF, the chance that a bit error will be followed by another bit 
error, denoted by p(ljl) in Table 1, ranges from a high of 42% to a low of less 
than 2%., depending on the data-line condition and bit error rate. For example, 
in the Green group, a long error-free transmission followed by a burst of errors 
lasting only one second may have a high probability of consecutive bit errors 
while another test run with burst of errors scattered through the whole duration 
may result in high bit error rate and low probability of consecutive errors. 
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Before we summarize the results of the report, let us fix our ideas of a 
burst. As definition we adopt an intuitive notion of determining a burst from a 
sequence of transmissions on the channel as a sequence of bits beginning and end- 
ing with an error, separated from the nearest preceding and following error by a 
gap of no less than a given length, say G, called the guardspace and containing 
within it no gap of length equal to or greater than G bits* From this definition, 
it is clear that the length (in bits) of a burst depends on the guardspace G; the 
longer the guardspace, the longer the burst length, some of the bursts at shorter 
guardspace being combined into a single burst when the guardspace becomes longer. 

For example, for G = 400, the first test run contains 322 bursts, the longest of 
which is 6133 bits containing 141 errors* The same run for G = 3600 has only 
100 bursts; the length of the longest burst is now 217362 bits containing 3550 
errors. This is typical of the GCF data line; in the error mode, there are still 
some good runs several hundred bits long but not long enough to allow more than 
a few 1200-bit blocks to pass through error-free* 

The histogram for the thirty-one runs of the high-speed 4*8 kbps data is 
shown in Figure 1* The error-free gap lengths are represented on the X-axis and 
their frequencies in the 31 runs, that is the number of times a gap of length X 
appears, on the ordinate* For example, the number of consecutive errors (at X = 1) 
is 17, 149, while the number of times gaps of length 100 ^ X ^ 499 appear is 652* 

(ii) Summary of Results 

There are two broad classes of theoretical models that have been proposed for 
burst noise channels: the Independent Gap Model (or the Pareto Model) , which assumes 

that successive gaps are approximately independent and suggests the Pareto distri- 
bution for the gaps, and the Markov model, which combines Markovian property with 
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Independent Gap property. The Markov model assumes that given that an error has 
occurred on the channel, the length of the gap following the error is independent 
of the length of the gap prior to that error bit. In general, however, when errors 
may occur in more than one state of the channel, the Markov model does not assume 
the Independent Gap property underlying the Pareto model. By looking at the graphs 
of certain functions of the empirical gap distributions, we showed that the Pareto 
model cannot be employed to model the errors on the GCF, Indeed, the Pareto model 
performs well only on good quality telephone channels (only in the Green error 
mode). Since our concern here is with the Red and Amber error modes, we have re- 
stricted our choice of a model to the Markov class and succeeded in getting a five- 
state model, diagrammed in Figure 3, that gives an acceptably good fit. 

The five-state model we used has only one error state B which connects to 
perfectly good states Gx, G 2 > G 3 , and G 4 . Errors occur in state B with probability 
one each time the process enters this state, consecutive errors occurring with the 
indicated probability 0 < q < 1. The further away a good state is from B, the longer 
the sojourn time of the process in that state. Long gaps indicate the process is 
in the best state G^^ and the short bursts of errors indicate transitions between 
the error state B and G^. 

A single state B in which errors occur with probability one is not really 
acceptable, as close scrutiny would reveal, for it is well-known that the error- 
causing mechanism on the channel does not reverse the bit each time an error occurs, 
a fact which we seem to ignore in our model. We hasten to point out, however, that 
a model using a state ^ in which errors occur with some probability 0 < h < 1 in- 
stead of B can be made to be mathematically equivalent to our model by appropriately 
increasing the number of good states and adjusting the corresponding transition 
probabilities. Moreover, introducing such a state B would involve unnecessary com- 
plications in the analysis. 
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A general method of getting maximum likelihood estimates (MLE) of the 
model parameters from the raw estimates obtained from data is presented. This 
method is applicable to any finite-state Markov process and hence to any Markov 
model • 

We consider the gap distribution a basic property of the channel because, 
in our model, the process renews itself each time it enters the error state. In 
other words, the occurrence of an error is the renewal event which wipes out the 
memory of the past gap* That is why we judge the performance of our model by how 
good a fit it gives to a function of the gap distribution as calculated from the 
data. Sample graphs of typical fits in each of the three error modes are shown 
in Figures 4, 5 and 6. Goodness-of-fit tests are performed for each of the 
error runs, each of the Green, Amber and Red error mode channels, and for a 

single channel obtained by combining all the error runs and treating each as an 

independent sample from some basic common distribution. Since it is more impor- 
tant, for purposes of error control, to have very accurate predictions of error 
clusters when the gaps are short (high bit error rate) than during long inter- 
vals of error-free transmission, we concern ourselves with just how good a fit 
we obtain for gaps of 4000 bits or less* The results are very good indeed for 
individual test runs* The Kolmogorov-Smirnov test predicts that in 99% of the 
time the error of our prediction (the absolute difference between the model and 
empirical values) should not be more than 3*6%* In the Red group, the maximum 

error of our prediction is 2*3% (see Tables 4a and b) , But the better the channel 

(the lower the error rate), the less spectacular this agreement becomes* For 
example, two of the 19 test runs in the Amber group fail this test only slightly 
while the fits obtained in the two cases that fail the test in the Green group are 
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less than satisfactory; in this case, the bit error rates are 2 x 10"^ and 9.8 x 10”^, 
and the percentage prediction errors are 51.4 and 4.5, respectively. We then ask that 
the errors not be more than 2.5%. The statistical test in this case says that about 
85% of our test runs should have less than this percentage error. In the Red group, 
the highest prediction error of 2.3% falls below the theoretical bound of allowable 
deviation, while about 58% of the Amber group pass the test. The important thing 
is that those test runs with high error rates all give acceptably good fit with 
prediction errors of less than 2.5%. 

The grouped channels (Red and Amber) give less excellent agreement with 
individual data runs. Errors of up to 8% are recorded in the Red group and 14.5% 
in the Amber, This fact is in great part due to the wide range of error rates 
recorded in each mode: 15 x 10"^ - 51 x 10’^ in the Red and equally wide varia- 

tion in the Amber. But the greatest revealing fact was obtained when we attempt 
to fit a single channel to every one of the 29 error runs. The error is about 10% 
in the Red, between 2% and 68% in the Amber, and up to 77% in the Green group. It 
is therefore clear that the errors on the GCF do not follow a single distribution. 

In other words, the channel performance is significantly different for varying line 
conditions. It is now understood that this is caused by the varying load on the GCF. 
When users come onto or drop off the channel, the characteristic, of the channel 
changes. A realistic model should incorporate the times between these changes and 
the characteristics of the channel when the changes occur. A way of constructing 
such a model is detailed. This and all the results mentioned above are presented 
in Section II. 

Section III is devoted to the autocorrelation of bit errors , This is the 
probability of having an error k bits away following a given initial error, ks 0. From 
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it we not only gain knowledge of significant error patterns but we also deduce the 
memory of the channel in the different error modes. For example, in both the Red 
and Amber groups the memory is very much longer than 1200 bits. It is only in 
the Green group that the memory is just about 1200 bits (see Tables 6 and 7) . 

In Section IV, we deal with the capacity of the channel (the maximal rate 
for which reliable transmission over the channel is possible). Since the capacity 
of a burst-noise channel is always larger than that of a binary symmetric channel 
(BSC) with the same bit error rate, which, at the error rate on the GCF, is large 
enough (>0.996 for the high-speed circuit and > 0.997 for the wide-band) , it is 

clear that for purposes of error control the capacity does not present any problem. 

The irony is that forward error-correction is more difficult than for the corres- 
ponding BSC . 

One group of statistics that turns out to be very important in estimating 
the performance of block codes is the block-bit statistics. A block is defined 
as a sequence of n bits for a fixed integer n. This group includes: 

(a) the block error rate as a function of block sizes; 

(b) distribution of the number of errors in a block; 

(c) distribution of distances between extreme errors in a block; 

and 

(d) distribution of errors in a code interleaved to some depth t. 

All these statistics are presented in Section V. 

Discounting the outages (those times when the error rate was so high that 
transmission was stopped) which McClure speaks about in [[ll, the block error rate for 
a 1200-bit NASA-standard block length in the 4800 bps data ranges from a low of .021 
of \7o during the Green error mode to as high as 1.87. in the Red group. An error 
block is defined as a block having one or more bit errors. Tables 10 & 11 show the 
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predicted and empirical block error probability for the HF 4.8 kbps and wide-band 
50 kbps circuits. In both cases the predicted values are very close indeed to the 
data values. There is also close agreement between predicted values of block 
thruput and the empirical values calculated by McClure in [l]. The block thruput 
refers to those blocks received error-^free. For the Green group the empirical 
value is 99^967d, the model value is 99,963%. For the Amber the values are 99,637o 
and 99.797o, the Red group gives 97,717o and 99.347c respectively. The total average 
empirical block thruput of 99.557. is close to the predicted value of 99.787o (see 
Tables 14 and 15). 

But it is the density of errors in the error blocks that is more important. 
For if an error block contains only one error it is an easy matter to locate and 
rectify that error with only a few changes in the present specifications on the GCF. 
Even if there are more errors but they are all confined within a given length in 
the block it is still possible to find a burst-trapping code that will correct all 
of them. This is why we have calculated not only the distribution of errors in a 
block and the proportion of the blocks with more than a given number of errors but 
also the distances between the first and last errors in an error block. The pro- 
portion of error blocks containing twenty-five or more errors is less than 25% in 
most of the runs at 48 bps. Runs in which this proportion is more than 50% are 
really badly hit for in them the proportion containing fifty errors or more is 
equally high, so that the 3-bit maximum error correction capability which can be 
achieved on the GCF even if all the 33 bits currently allowed for error detection 
and correction in each 1200-bit block were used for error correction alone would 
still fall short of correcting a large proportion of the error-blocks (Table 13a 
and b) . Graphs of the distribution of errors in a block are provided in Figures 8, 
9, and 10. 
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The effect of the fixed error pattern introduced by the rate one code 
built into the modem becomes apparent in the distribution of distances between 
extreme errors (or of length of burst of errors) in a block mentioned earlier. 

If we consider only those blocks with two or more errors, the proportion having 
their burst confined to within exactly 23 bits is as much as 24% in the Red group; 
in the Amber group there is a run with as high as 83% while a percentage of 98 
is recorded in the Green group (Table 5). This explains why the empirical and 
model values in this case are not as close as one has expected (Table 16) and 
further impels us to remove this error-causing modem code so as to be able 
realistically to assess the performance of the different error correcting schemes 
that are now being considered for the GCF, 

A rather effective way to correct burst noise is to interleave the coded 
blocks to some depth t, say. Here the bits of each coded block are not trans- 
mitted consecutively but are interspersed in such a way that they are transmitted 
exactly t bit positions apart. For sufficiently large t, at the receiver, the 
blocks appear to have been corrupted by random noise thus spreading out the 
errpr clusters over many blocks. The trick is to thin out the errors in each 
block to a sufficiently low number that a known error correcting code (e.g., a 
BCH code) of high enough capability can be used to correct the resultant errors. 
Taking t = 6 and interleaving each block so that the length of each block trans- 
mitted separately is 200 bits we found that the proportion of blocks containing 
few errors has increased thus decreasing the proportion with a large number of 
errors. For example, in the Red group the proportion containing exactly one 
error increased to 0.06 of 1% with only 0.0045 having four or more errors; these 
proportions are 0.05 of 1%, and 0.0006 respectively for the Amber group and propor- 
tionately higher numbers for the Green group. The encouraging fact is that in each 
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case the proportion of blocks with three errors or less is at least twice as 
high as it is without interleaving. One therefore can expect better results 
(higher proportion having fewer errors) when the depth of interleaving t is 
increased thereby enabling us to correct a sufficiently high proportion of the 
errors without significantly changing the present GCF standard specifications. 

As opposed to the distribution of bit errors in a block found above, in 
Section VI we concern ourselves with Block (symbol) Error distribution. In this 
case a block is considered as made up of symbols, each symbol being a fixed num- 
ber of bits. Because of the burst noise it may be more efficient to employ an 
algorithm designed to correct up to a given number of symbol errors in a block 
rather than one that can correct only bit errors. This is especially so in cases 
where, although the number of bit errors in the block is higher than the error- 
correcting capability of the code employed, the errors are all confined to within 
only a few symbols. 

Let us mention particularly the distribution of error symbols in n-symbol 
word and the autocorrelation of error symbols. For the standard 1200-bit block, 
if a symbol length of 6 bits is used, then the proportion in the Red group of the 
200-symbol blocks that have 3 symbol errors or more is only 0.0061, Thus a code 
having two symbol error-correcting capability will fail to correct in only 0.61 
of 17a of the time. To achieve this efficiency an error-correcting code must be 
able to correct up to 5 bit errors in the 1200-bit block. The proportions for 
symbol lengths s = 8 and 10 and for all the different error groups are shown 
in Table 21, Table 22 contains the autocorrelation of symbol errors for 
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symbol lengths s = 6 and 10 bits. For s = 6 bits, the highest correlation 
between an initial error-symbol and another 200 symbols away is 0,08, the least 
is 0.0028 in all cases in the 4800 bps circuit where the symbol error memory is 
longer than 200, This means that unless combined with forward error-correction, 
a feedback retransmission scheme may be impractical on the GCF for error correc- 
tion because occurrence of an error block (symbol) may cause a high number of 
others to occur in quick succession thus causing a problem of buffer over-flow. 

As an immediate application of the block-bit and symbol error distribu- 
tions we find the sync acquisition and maintenance probabilities. The two strate- 
gies we compare are both based on using a prefix sequence of 24 bits in each of 
the 1200-bit blocks. These strategies are: 

1, to accept sync if there are not more than 3-bit errors in the pre- 
fix sequence 

2, to accept sync if there is at most (only) one error symbol in the 
24-bit prefix considered as four 6-bit symbols. 

Our criterion of comparison is the efficiency of each of the algorithms in 
reacquiring a lost synchronization within a frame of 1200 bits after it is lost and 
of maintaining it once it is reacquired. It is found that the first algorithm 
will lock onto the wrong synchronization in over 167 o of the time although it will 
hardly fail to identify the true sync sequence. On the other hand, the second 
scheme will lock onto the wrong sync in less than 2% of the time and it is equally 
as efficient as the first in not failing to identify the true sync sequence 
(Tables 23a and b, 24a and b) . 

This conclusion is not surprising since the second algorithm takes advan- 
tage of the burst noise by allowing up to 5 errors provided they all occur within 
a single 6-blt symbol. 
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To answer the question as to when the first algorithm is efficient, we 
increase the prefix sequence to 30 bits and find that allowing up to 3 bit-errors 
will falsely detect a sync in less than 0.5% of the time. In this case the 
second algorithm provides ample protection against both types of errors , 

Lastly, in Section VII we take up the important question of burst dis- 
tribution. The fact that no good error correcting device can be constructed 
without the knowledge of this distribution attests to its importance. 

To understand the nature of the bursts we find out how long they are, how 
dense the errors within them are and particularly how many 1200-bit blocks are 
affected each time the channel enters into a burst mode. Specifically, we calcu- 
late the 


(a) distribution and mean of burst lengths; 

(b) the distribution of the density of errors in a burst of given length 
and the mean number of errors in such a burst; 


(c) the block-burst distribution. 


The last distribution is intended to give us an idea of the number of blocks that 
are likely to be affected each time a burst of error occurs. 

But before we answer the above questions we review two criteria of choos- 
ing an optimum guardspace G since all the distributions depend on G. We agree to 
call a G optimal for a code C (with desirably high rate R) if a high proportion 
of the bursts (with respect to G) is less than the burst correcting capability 
of C. The burst correcting capability of a code relative to G is the largest 
integer, b, for which every noise sequence containing only bursts of length b 
or less is correctly decoded. It is shown that a guardspace of 400 bits hitherto 
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being used in [1] is not adequate. However a guardspace of 3600 bits seems to 
work for both the 4800 bps high-speed and the 50 kbps wideband data. See Tables 
27a and b for the bursts using different values of G. 

In the 4800 bps circuit the mean burst length varies, from 41 bits in the 
Green error mode to 340 bits in the Red with an average of 135 bits overall. The 
high standard deviation of burst lengths (Table 28) is explained by the wide 

_3 

variation in the bit error rate (0-10 ), 

The error density in a burst is obtained for guardspace G = 400 and 3600 
bits (Table 29), This density can be as high as 6% in the highest error mode; 
in the 50 kbps circuit it can be up to 8% when G = 3600, As expected the mean 
number of errors (and the ratio of bad/good bits) in a burst decreases with in- 
creasing guardspace. 

Using the standard 1200-bit block and a block guardspace of 10 blocks 
there is as high as 57. probability of getting a block-burst extending to 10 
blocks or more (when the channel is in the Red error mode) . 

Our opinion of this work is contained in Section VIII which also lists 
a few problems indicating the line future investigations should follow. 
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Section II 


CHANNEL MODEL AND PARAMETER ESTIMATES 
(i) Criterion for Choosing a Model 

The histogram for the thirty-one runs of the U.8 kbps high-speed data is 
shown in Figure 1 below. The error- free (gap) lengths are represented on the 
X-axis and their frequencies in the 31 runs, that is the number of times a gap 
of length X appears, on the ordinate. For example, the ntunber of consecutive 
errors (at X = 1) is 17,1^9 while the number of times gaps of lengths 
100 i X « appears is . Actually a gap length as shown on the histogram 

includes the position of the error bit that ends the gap. Thus to get the 
number of gaps of length 500, say, we should read the ordinate at the point 
X « 501. 



Figure 1. Histogram for the 4.8 kbps high-speed data. 
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The common feature of the itSOO bps high-speed and the 50 kbps wideband 
data is the way the errors tend to cluster together. Long gaps of error- free 
transmissions are followed, by up to four seconds (or more) of sputtering errors. 
Within these "bursts" of errors there are intervals of good data. The problem is 
to construct a model for such a channel from which to derive statistics for error 
correction (both forward error correction and feedback retransmission). 

Two broad classes of models have been proposed for these channels depend- 
ing on what have been considered their main features. These are the Independent 
Gap or Pareto model and the Markov model. The Pareto model, so called because it 
assumes that successive gaps are approximately independent, was espoused by Berger 
and Mandelbrot in [3]. Pareto distribution was suggested for fitting the gaps. 
Earlier on, a Finite State Markov Chain had been suggested by Gilbert [4*] for 
fitting the error sequence on such channels. The reason was that the gaps in 
the data seem to combine the Markovian property with the independent gap property. 
This class of models is called the Markov model. In the general case when the 
error clusters are different for different phases of the channel and hence more 
than one error state is used in the Markov model no assumption of independent gap 
distribution is made (see the generalization of Gilbert's model by Berkovits, 
Cohen, and Zierler in [5]). Up to now the choice of which model to use has not 
been based on an explicit criterion. 

We shall briefly review both models and give a criterion for deciding which 
applies to a given set of data. 

a. Independent Gap (Pareto) Models 

Let (z^} be the error sequence, i.e., z^ = 1 if the bit is in error 
and 0 otherwise. Let 


16 


JPL Technical Memorandum 33*-699 



V(k) = P(o’^l]l); 


k ^ 0 


( 1 ) 


be the gap distribution where k is the gap length. Let 


U(t) S V(k) . 
k^t 


Thus 


U(t) = P{gap of length k :» t) . 


Berger and Mandelbrot in TS] provided evidence, from the data they used, that 
successive gaps are approximately independent and suggested using Pareto 
distribution t”*^ for the gap distribution. That is. they put 

U(t) t”®^ ; 0 < O' < 1 (2) 


or 


P(gap of length k < t) = 1 - t 


with probability density function For this range of or the 

moment cy S does not exist for any finite n. So t is restricted to 

some interval 0<6 st ^ h < m and (2) is reduced to a three parameter model 


fx 


U(k) = \ 


{k/6)-"^ 


0 


k < 6 
6 5 k 5 L 
L < k 


(3) 
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It Is convenient to use 


0 


log U(k) = i O’ log 6 - Of log k 


k < 6 
6 ^ k < L 
L < k 




From (4) it is not difficult to calculate the average bit error rate, 
the block error rate, the ratio of error probabilities for two different 
block lengths, etc. For example; 


P 





(5) 


It is also a straightforward matter to estimate the parameters of this model. 

The best fitting straight line for k between 6 and L has slope of. The 
intercept at probability one occurs for a gap length 6^1 and L can be 
estimated from relation (5), having been obtained from data. See Sussman 
[6] for further details . 

But it is the shape of the graph of (4) between 6 and L that we shall 
dwell on here. Within this interval log U(k) is always a straight line, so 
that any channel whose empirical U(k) cannot be fitted with a straight line on 

the log- log plot cannot be modelled by the Pareto distribution. For the GCF, 

Figures 4, 5 and 6 show that log U(k) is convex for k ^ 2000 and then 

becomes approximately a straight line. The interval of convexity of log U(k) 
depends on the error mode; for the Green (low bit error rate) group k s 150 and 
it increases as we enter the Red error group. Thus we reject the Pareto model 
for the GCF. 
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Markov Model 


b . 

Gilbert's original model shown in Figure 2 consists of two states G and 
B, The channel alternates between the 'good' state G and the 'bad' state B accord- 
ing to a set of transition probabilities as shown. Errors may occur only in 
state B with some probability 0 < h < 1. Transitions between states G and B 
plus the possibility of sojourn in either state (with probabilities Q and q, 
respectively) generate the bursts. Occurrence of an error implies the channel 
has returned to the state B and the process begins anew. Therefore successive 
gaps are independent and the model has both the Markov and independent gap 
property. 


P 



Figure 2 , Gilbert Model 

Gilbert showed , among other things , that 

U(k) = ML^ + k 2: 0 (6) 

where M, N, J, L all depend only on the nodel parameters and 0 < L « J <1, 

Hence log U(k) is NOT a straight line for small to moderately large values 
of k. For sufficiently large k, log U(k) behaves like cj^ + k log J, a straight 
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line, where is a constant. This more closely resembles the shape of 
log U(k) shown in Figures 4, 5 and 6. 

We shall therefore construct a Markov model for the GCF. The mant modes 
of the histogram (Figure 1) convince us that neither the Gilbert model nor its 
two-state generalization Zierler, et al [5] can provide adequate fit for our 
data. See also our earlier attempt in [2]« 

Let us now detail the procedure taken to construct a Markov model from the 
histogram. 

(ii) The Model and Its Variations 

We want to choose a ^^natural’* model suggested by the histogram in the 
following way. Let us make the assumption that whenever an error occurs, the 
behavior of the channel at the time is independent of how good the channel 
was prior to that time. In other words the behavior of the channel each time 
it enters the bursty state is statistically the same irrespective of how long 
the time has been since it last showed this burst phenomenon. (In this report 
as In Reference 2 we use the same definition of burst, i.e. as a sequence 
beginning and ending with an error, separated from the nearest predeeding and 
following error by a gap of no less than a given length, say G the guardspace 
and containing within it no gap of length equal to or greater than this guard- 
space.) Each time the channel enters a burst, that is each time we observe an 
error after a long gap, the length of the burst, the number of errors within it 
and the distribution of these errors are therefore all independent of what had 
gone on prior to the occurrence of this phenomenon. We can therefore represent 
distinct groups of gap lengths by distinct states of the channel and indicate 
the beginning of a burst by a return to a single error state from states repre- 
senting long enough gaps. The short gaps and consecutive errors within a burst 
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are then represented by transitions between this error state and those states 
representing appropriately short gaps. We shall make this concept more precise 
as we go along. 

With respect to this "natural" v;ay of constructing a model for the 
channel it would be necessary to represent each mode of the histogram by at 
least a state each of which connects to a single error state. We would there- 
fore represent gaps of the following range of lengths corresponding to the 
modes of the histogram by distinct states: 

X = 1 ; 2 « X s U9, 100 X ^ i^ 99 ; 1000 < x s 499; 

10,000 < X ^ ^ 9999 ; 100,000 « X ^ 499999 and X s 10 ^ ^ 

a total of seven states for the channel. But there are a number of objections 
to having so many states for the channel. These include the fact that a model 
with so many states may be unwieldy to analyze and even if we succeed in 
doing the analysis, such a model would be of very little practical use. A 
model should not be more complicated to understand than the phenomenon 
it is designed to explain! 

The five state model we found to give acceptably good fit is shown in 
Figure 3 below. State B is the error state which connects to the perfectly 
good states G^, G^ and G^. The good states represent gap lengths 

X s 10^, 1100 £ X ^ 99999 > 50 5 x e 1099; 2 X ^ 49 ( 7 ) 

respectively. These interval boundaries were determined from the histogram. 

All errors occur in state B, consecutive errors occuiTing with Indicated proba- 
bility 0 < q < 1. Short bursts represent transitions between states B and Gj^^. 
Varying gap lengths are represented by transitions between state B and G^, 

G^J the very long gaps indicating the process is in state 
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The model is not unlike four workmen with varying degrees of efficiency 
Pi > Pg ^ P 3 ^ Pk employed to maintain a system. We agree to call a workman 
and his efficiency rate by the same name. Each time the system breaks down (in 
state B) any one of the four workmen is called upon to do the repairs, workman 
p. being called with probability c.; j = 1, U. The probability is q that 
the maintenance supervisor will not call on any one of the workmen immediately 
the system breaks down. If he calls however, the length of time after the 
repairs are done for which the system remains in working condition is propor- 
tional to the workman’s efficiency. In other words if workman p. is called 
upon the chances are q. that the system will not stay in working condition 
the next unit of time. Thus the lower the workman’s efficiency the higher his 
j = 1, 4. (This analogy was suggested by E. C. Posner.) 

u 


22 


JPL Technical Memorandum 33-'699 



In terms of transitions, is the probability that the process stays 

in state G.; c* is the probability that the system moves from state B to state 

Gi in one step while q is the probability of returning from state G* to state 
J j J 

B in a single step, j = I 5 4; q is the probability of remaining in state B, We 
denote the one-step transition probability of going from state i to a state k 
by P(k|i). Thus 

- Pj 


P(GjjB) = Cj 

P(b|g.) = = 1 - Pj 

P(B|B) = q = 1 - Ecj 


j = Ij-i. 


The physical explanation given above is not really acceptable as a close 
scrutiny would reveal. For it is fairly well-known that the error-causing 
mechanism on the channel does not reverse the bit each time an error occurs, a 
fact which we seem to ignore in our model in which we allow errors to occur 
with probability one each time the process reaches the state B. We hasten to 
point out however that a model using a state B in which errors occur with some 
probability 0 < h < 1 instead of B can be made to be mathematically equivalent 
to our model by appropriately increasing the number of good states and adjusting 
the corresponding transition probabilities. Moreover, introducing such a state 
B would involve unnecessary complications in the analysis. 

Even then the five-state model rather over-simplif ies the actual channel. 
For instance we have assumed that it is possible to fit a channel with error 
rate varying between 0 and 10 "^ and exhibiting three distinct error modes by 
a single stationary model. If the model performs well at high error rates 
it cannot be expected to depict the channel in the Green error mode. For 


JPL Technical Memorandum 33^699 



when other users of the GCF come onto or drop off the line, the characteristics 
of the channel change significantly*, A realistic model must incorporate such 
changes . 

One such model can he obtained as a generalization of our five state model. 
Instead of the p., c., j = 1, 4, being a fixed set of parameters we use 

1, 4, m == (Red, Amber, Green)} . (8) 

That is, we use a separate set of parameters for each of the error modes, much as 
we have done in this study, but further incorporate the varying line conditions 
caused by users coming onto and dropping off the line. The number of users on the 
channel at any given time csui be modelled by the Poisson distribution, P(X), with 
some parameter X, and the times between changes in the line condition then follow 
the exponential distribution. This means the line condition changes according to 
the Poisson distribution, the times between these changes following the exponential 
distribution. When a change does occur it can be to only one of the error modes 
Red, Amber or Green. The parameter X of the Poisson distribution can be 
identified with the mean rate of user arrivals or the mean number of users per 
unit time. The estimation procedure for parameters in (8) is the same as is used 
in this study. 

Because of its simplicity we have decided to use the stationary five-state 
model however. As a fitting model for the highest to moderate error mode (Red 
and Amber) our results bear us out. 

Before we talk about how good a fit the model gives to the data let us 
take a look at the estimation procedure employed. 


We are grateful to L. R. Welch for discussions leading to this understanding. 
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(lii) Estimation of Parameters 


In this section the procedure for estimating the model parameters p. 

3 

and e., j = 1, from the data will be reviewed and the method for getting 
the Maximum Likelihood Estimates (MLE) indicated. For detailed analysis of the 
MLS method the reader is referred to Appendix I. 

Let £. = number of times the process enters state j, j = 1, U. 
kji = the length of gap i, i = 1, in state j, j - 1, U. 
kj = the threshold to state j or the minimal gap length determin- 
ing state j, j = 1, U. 

N = the number of errors in the run. 
e 

" cardinality of [x = l] or the number of occurrences of gaps 
of length zero in the run. 

Then it can be shown (see Appendix 1) using the method of maximum likelihood. 


that 




i. 


i==l 


k . . .k . 

01 <3 J 


j 

S k..-£.k.+£. 
Ji 0 0 0 




C . 

3 


= 1 “ P, 


N. 


0 = 1 , 


(9) 


q = 


N. 
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The easiest way to understand the above expressions is to consider 
as the proportion of time spent in state j as a fraction of the sum total of 
time spent in j and the number of times the process enters j; c. as the 
number of times it enters state j as a fraction of the total number of times 
it crosses state B. 

The above estimates we call raw estimates because they are obtained 
directly from the data using the gap length intervals stated in (7). These 
estimates are shown in Table 2(a). Table 2(b) contains the raw estimates for 
the 50 kbps wideband data for the same gap intervals used above for the k.Q kbps 
high frequency data. The reader is asked to refer to Reference [2] for a descrip 
tion of and histogram for the 50 kbps data. 

Let us now indicate how the optimum set of model parameters are obtained 
from these raw estimates. 

Denote the probability of getting k error- free bits between a given 
error and the one immediately following it (i.e., a gap of length k) by V(k); 

V(k) = P(0^l(l) 

where {0^1 |l} is the event that a given initial error is followed by a gap of 
length k. Then 

V(k) > P(o‘‘|l) . P(o‘'*l|l) 

= U(k) - U(k+1) 

I IC 1 

where we have denoted P(0 [l) by U(k) and {O jl} is the event that a given 
error is followed by at least k error- free bits. Also let us represent the 
sequence of noise digits by z = which = 1 ■ if the digit is in 

error and = 0 otherwise. 
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Table 2 (a). Raw estimates of P and C for 

the 4.8 kbps high frequency data 


1 

A 

P 

.9999987 

.9999130 

.7890661 

.9956507 


A 

c 

.20483731>-02 

.96115950-02 

.6137241 

.59245252-01 

2 

A 

P 

.9999994 

.9999866 

.7843750 

.0000000 


A 

c 

.1071429 

.89285710-02 

.6160714 

.0000000 

3 

A 

P 

.9999998 

.9999731 

.6190476 

.9979798 


A 

c 

.1265823B-01 

.63291150-02 

.5738397 

.42194090-02 

4 

A 

P 

.9999986 

.9999738 

.7050070 

.9942991 


C 

.2223089B-OI 

.i 404056 b -01 

.5721529 

.66302640-02 

5 

A 

P 

.9999935 

.9999675 

.7701746 

.9915816 


A 

c 

,/;04459E-01 

.28722160-01 

.5709261 

.35756162-01 

6 

A 

P 

.9999998 

.9999557 

.7037037 

.9978564 


A 

c 

.I866252E-OI 

.15552102-01 

.5598756 

.62208400-02 

7 

A 

P 

.9999993 

.9999381 

.7216154 

.9977547 


A 

c 

.13318930-01 

.19798420-01 

.6054716 

.10439162-01 

8 

A 

P 

.9999998 

.9996789 

.7849265 

.9982025 


A 

c 

.60000002-01 

.45000000-01 

.5850000 

.15000002-01 

9 

A 

P 

.9999998 

.0000000 

.9047619 

.0000000 


A 

c 

.6666667 

.0000000 

.6666667 

.0000000 

10 

A 

P 

.9999998 

.9978070 

.7634730 

.9950000 


A 

c 

.76983072-01 

.76923072-02 

.6076923 

.30769232-01 

11 

A 

P 

.9999998 

.9999652 

.7120360 

,9934446 


A 

c 

.59808600-02 

.35885170-02 

.6124402 

.13157890-01 

12 


.9999999 

.9992025 

.5198413 

.0000000 


A 

c 

.94339620-02 

.47I698IB-O2 

.5707547 

.0000000 

13 

A 

P 

.9999999 

.9878049 

.5378705 

.9967664 


A 

c 

.94594580-02 

.13513512-02 

.5689189 

.54054040- 02 

l4 

A 

P 

.9999996 

.9999225 

.7606132 

.9961320 


A 

c 

.18154310-01 

.19667172-01 

.6142209 

.25718612-01 
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Table 2(a) Cont’d. 


15 

A 

P 

.9999981 

.9Q99U80 

.7990319 

.9978045 


A 

c 

.3760163F-01 

.6402U39E-01 

.6117886 

.1626016E-OI 

l6 

A 

P 

.999999h 

.9999822 

.8977956 

.0000000 


A 

c 

. 3200000 

.1333333E-01 

.6800000 

.0000000 

17 

A 

P 

.9999996 

.9998726 

.7000611 

.9971001 


A 

c 

.2729528E-OI 

.1736973E-OI 

.6091812 

.7444169E-02 

18 

A 

P 

.9999998 

.0000000 

.6358382 

.9915493 


A 

c 

.26905831;- 01 

.0000000 

. 5650224 

.1345291E-01 

19 

A 

P 

.9999995 

.9999472 

.7892157 

.9981061 


A 

c 

.iv699739H:-01 

.1566580E-01 

.5613577 

.3655352s- 01 

20 

A 

P 

.9999985 

.9999343 

.8945498 

.9974043 


A 

c 

.1307693 

.1461539 

.6846154 

.3076923s- 01 

21 

A 

P 

.9999991 

.999899^+ 

.6274128 

.9959416 


A 

c 

. 3273322 S-02 

. 409165^1 E- 02 

.5739225 

.8728858E-02 

22 

A 

P 

.9999973 

.9998945 

.8599168 

.9978881 


A 

c 

.89820381;- 01 

.6586826E-OI 

.6047904 

,5988024e-01 

23 

A 

P 

.9999995 

.9999318 

,6606772 

.9960402 


A 

c 

.3120125E-02 

.5512219E-02 

.5951118 

.8840352E-02 

2 h 

A 

P 

.9999992 

.9999160 

.8674912 

.9953895 


A 

c 

.29it53022-01 

.2805049E-OI 

.7363254 

.5329593B-01 

25 

A 

P 

.9999992 

.9999613 

.7558528 

.ooooooo 


A 

c 

.3OI8IO9E-OI 

.IOO6036E-OI 

.5875251 

.ooooooo 

26 

A 

P 

.9999982 

.9999579 

.7334171 

.9961382 


A 

c 

. 2268431 R- 01 

.4190296B-OI 

,6014493 

.1102709E-01 

27 

A 

P 

.9999985 

.9999592 

.7450024 

.9939882 


A 

c 

. 198793 OE- 01 

.1668442E-01 

.5750799 

.2O94427E-OI 

28 

A 

P 

.9999992 

.9999778 

.5950413 

.9983525 


A 

c 

.I7O6037E-OI 

.1181103B-01 

.5787402 

.I312336E-O2 

29 

A 

P 

.9999997 

.9999112 

.6872549 

,9956168 


A 

c 

. 17175 57 E- 01 

.9541985s- 02 

.6087787 

.I335878E-OI 
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Table 2 (b). Raw estimates of P and C 


for the 50 kbps wide-band data. 


1 

A 

P 

. 9999995 E + 00 

. 99993422+00 

. 60417465 X -00 

. 9922753 E +00 


c 

. 8959681 E -02 

.8461922 5 ;- 02 

.51916385x00 

.547536IE-02 

2 

A 

P 

.999999715,. 00 

. 9999660 00 

.613483IE+OO 

.99132062+00 


A 

c 

.6868917s- 02 

.1717230E-02 

. 4922725 E +00 

. 8013736 B- 02 

3 

A 

P 

.99999983+00 

.9999250E+00 

. 60000005 X -00 

.9951378E+OO 


A 

c 

.II30U35S-02 

.2608696s- 02 

.U434783E+OO 

.26086962-02 

k 

A 

P 

• 9999999 S +00 

.OOOOOOOE +00 

. 81304355 X -00 

.96774195x00 


C 

.i^ 59770 is -01 

.OOOOOOOE +00 

. 49425295 X -00 

. 11494252-01 

5 

A 

P 

. 99999993+00 

. 00000002+00 

.6063218E+00 

.OOOOOOOE+OO 


A 

c 

.192307753-01 

.OOOOOOOPJ+OO 

. 52692 31E+00 

. 00000005 X 00 

6 

A 

P 

. 9999999 EH 0 O 

.9999IOOS+OO 

.47685195x00 

. 9 S 13084 e +00 


A 

c 

. 13100442-01 

.43668IIE-O2 

.J+934498E+00 

.87336232-02 

7 

A 

P 

. 99999993+00 

. 00000005^^00 

. 58695652+00 

.97402605x00 


A 

c 

.225 9887s- 01 

. 00000005 >f 00 

. 4293785 S +00 

. 1129944 b- 01 

8 

A 

P 

.99999993+00 

.OOOOOOOE+OO 

. 61933545>+00 • 

.OOOOOOOE+OO 


A 

c 

.1716738s- 01 

. 00000005 X 00 

. 5407726 E+ 00 

.OOOOOOOEHOO 

9 

A 

P 

. 99999993+00 

. 00000002+00 

.5322581E+00 

. 9929 O 78 EHO 0 


A 

c 

.20161292-01 

.OOOOOOOE+OO 

. 4677419 E +00 

.40322582-02 

10 

A 

P 

• 9999998 E+ 00 

.99995953+00 

. 5989176 E+ 00 

.996299711x00 


A 

c 

.97323622-02 

.8IIO3OOB-O2 

.5409570E+00 

. 3244120 E -02 

11 

A 

P 

. 9999998 E+ 00 

• 9999375 E +00 

. 56384185 X 00 

.99608062+00 


A 

c 

,5 791504 s - 02 

. 51480052-02 

.'49678255X-OO 

.45045062-02 

12 

p 

. 9999998 E+ 00 

.9998188E+00 

. 8231986 E+ 00 

.99366685x00 


A 

c 

.657765OB-O3 

.3617707E-02 

. 65348955 X -00 

. 334 i 446 e -01 
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If wre write the model transition matrix as 



then the probability of getting the particular pattern of errors observed on the 
channel P(M, z) is given by: 


W -1 

p(M, z) = P u(je)u(L) n (10) 

>1 ^ 


where = theoretical bit error- rate. 

A = number of the error-free bits before the first error 
in the run. 

L = number of the error-free bits after the last error in the run, 

and here 

i. = length of gap> j = 1, IT -1. It is desired to maximize P(M, z) 
J e 

subject to some restrictions. It is easy to show that: 


where 



1 

A 

c 


c 


1 + 


s 

i=l 



S 
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> 


k S 0 


(11) 


U(k) = 


k— 1 

2 “A 

1-1 


and 


V(k) s E c^(i - P^)pJ"^ 
i=l 


The set p^, c^, i = 1, 4, maximizing P(M, z) also maximize 


K -1 
e 


1,-1 


log = log E c^pf”^ + log c.p^ ^ + S log E C.(l-p. )p,^ (12) 

?! i^^ i^^ j=l j^iii 


is given by 




- £41 

c.p. 
1^1 


C.p" 

1^1 


- L+1 N -1 - Jd 

^ ‘ - s 1. ^ ‘ 


P. = 




C.pf c.pf'l 

(ji+1) TT- ^ -r r - + -> ■ --■ ^ — + (irU) 


- L 


c.p?^" c . pf J* 

^ ^ +S(£,41) ^ ^ 


/~ ~ A 4 =-^ 4 L4i; 7- 4 4 ZJ^Z.+l) 

(i-p.)\ ii-v^)\ J 


and if 


c.p^ 

1^1 




c. - 




c.p^J 

11 


(Ng4l)((£4l) 


C,P^ c.pf-1 
i 1 11 


t: v^l t; rM^ N -l - 

c.p. c.p. e c.p." 

11 . 11 . T' 1 1 


4- +(l4l) pv- A - -H E(je,+1)-^ 

'"-f'i*"! (1-Pi)\ (1-Pi)\ j=l " "d 


then 


c. = 


— t t 

C.p. 
I X 


1-p. 
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where 


c, 

X 


Oi(l-Pi) 


Pi 


♦ ry 

* 1 


- je 
4 c.p. 
T' 

• 1 1“P. ' 
1-1 




k 

s 

i=l 


11 

i-p. 


(13) 


and 


O'. 


4 

i=l 


c.p.*^ 

11 


t f' A ^ 

To obtain the estimates p. and c. the raw est5.mates p. and c. in 

(9) are used in (13) as first approximations for p^ and c^. Using the 
and thus obtained as initial estimates for p^ and c^, the above procedure 

is repeated on a digital computer to give a new set of maximizing parameters, 

II II 

and p^. This iterative method is repeated until a degree of stability suffi- 
cient for curve fitting purposes is achieved. See Baum and Welch [7] for further 
details of this iterative method. 

In general, note that it is possible for distinct transition matrices M 
to yield the same z-process and thus the same P(M, z). Let us call all such 
matrices equivalent. For example, as shown by Blackwell and Koopraans [8], any 

two matrices M^, yielding the same V(k) are equivalent. It suffices for 

our purpose therefore to find any one member M in this equivalence class, i.e., 
any transitions matrix M yielding a critical point of P(M, z). 

Starting with the raw estimates in (7), two-hundred iterations on the 
computer yield the maximizing parameters shown in Table 3(a) for the 4,8 kbps 
data and Table 3(b) for the 50 kbps data. 

(jy) Curve Fitting and Goodness- of- Fit Test 

A basic statistic in our model is the gap distribution V(k) because the 
process renews itself each time it reaches state B. In other words the 
occi.u?rence of an error is the renewal event which wipes out the memory of the 
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Table 3(a) Maximum likelihood estimates (MLE) of P and £ 

for the 4.8 kbps HF dateline 


1 

P' 

.9999953950E+00 

.9968072943E+00 

.5717325689E+00 

.91 06591 544E+00 


c' 

.8981760809E-02 

. 6745001 194E-01 

. 420331 7556E+00 

.1713520791 E+00 

2 

P' 

.9999995475E+00 

.999991 7204E+00 

.761 9005941 E+00 

.OOOOOOOOOOE+OO 


c' 

.1047399680E+00 

.1 030998001 E-01 

.6742438692E+00 

.OOOOOOOOOOE+OO 

3 

p' 

.9999998554E+00 

.9999535030E+00 

.5459808331 E+00 

.8961057897E+00 


c’ 

.1299700189E-01 

.1017617426E-01 

.5091828334E+00 

.3962966780E-01 

4 

p' 

.9999986050E+00 

.9999567087E+00 

.5519996996E+00 

.922781 1211 E+00 


t 

c 

.2665215750E-01 

.1285778017E-01 

.4786566726E+00 

.8614839406E-01 

5 

p' 

. 999998431 3E+00 

. 99991 98994E+00 

.5999923080E+00 

.9601877004E+00 


c’ 

.5125120746E-01 

.2359833744E-01 

. 46271 44092E+00 

.1478188950E+00 

6 

p' 

.9999997989E+00 

.9999390274E+00 

.542501 2681 E+00 

.91 0650381 8E+00 


c' 

.1905607045E-01 

.2137889279E-01 

.4755474738E+00 

.7556629062E-01 

7 

p’ 

.9999993455E+00 

.99992821 91 E+00 

.6780365331 E+00 

. 997271 4645E+00 


c' 

.1504603869E-01 

.1804396336E-01 

.6435550889E+00 

.1771507139E-01 

8 

p' 

. 99999981 85E+00 

.9997777749E+00 

.7532956976E+00 

.9992440385E+00 


c’ 

. 59891 27950E-01 

.3638922970E-01 

.6622567025E+00 

.2454708225E-01 

9 

p’ 

.9999999229E+00 

.OOOOOOOOOOE+00 

.91 3043371 8E+00 

.OOOOOOOOOOE+OO 


c' 

.750001 9813E+00 

.OOOOOOOOOOE+00 

. 45651 98227E+00 

.OOOOOOOOOOE+OO 

10 

p' 

•9999998558E+00 

.9979309774E+00 

.5581 9591 lOE+00 

.8535666459E+00 


c' 

.7636851511E-01 

.4016395427E-01 

. 320071 0332E+00 

.2645899237E+00 

n 

p' 

. 999999901 OE+00 

.999931 5091 E+OO 

.5970954356E+00 

. 9689291 927E+00 


c' 

.6033735685E-02 

.7273912417E-02 

.5537134181E+00 

.5750297725E-01 

12 

p' 

.9999999537E+00 

.OOOOOOOOOOE+00 

.5454545438E+00 

.OOOOOOOOOOE+OO 


c' 

.1408451107E-01 

. OOOOOOOOOOE+OO 

.5377720829E+00 

.OOOOOOOOOOE+OO 

13 

p' 

.9999999219E+00 

. 90061 92752E+00 

.5363738531 E+00 

.9979544427E+00 


c' 

. 94481 04969E-02 

. 97 56278883 E- 02 

.521 61 92007 E+00 

.7212404850E-02 

14 

p' 

.9999995490E+00 

.9992407306E+00 

.61 63 191 023 E+00 

.9081214215E+00 


c' 

.2258593837E-01 

.3780571254E-01 

.4839834401 E+00 

.1401 222301 E+00 
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Table 3(a) Cont'd 


15 

P’ 

. 9999981 532E+00 

. 9999391 884E+00 

.7730621732E+00 

. 99871 74063E+00 


c' 

,4522522687E-01 

.5680939256E-01 

.6806276662E+00 

.1 75086621 4E-01 

16 

P' 

.9999996082E+00 

. 9999983751 E+00 

.9072722727E+00 

.OOOOOOOOOOE+00 


c' 

.2151236379E+00 

.1138326308E+00 

.60881 91 268E+00 

.OOOOOOOOOOE+00 

17 

p' 

.9999996682E+00 

.99991 15533E+00 

.6802853951 E+00 

.9991929634E+00 


c' 

.2775160346E-01 

.1144913912E-01 

.6442882584E+00 

.1370192609E-01 

18 

p' 

.9999998824E+00, 

.OOOOOOOOOOE+00 

.5245536494E+00 

.9819256259E+00 


c' 

.2679576549E-01 

.OOOOOOOOOOE+00 

.4839356004E+00 

.4972253088E-01 

19 

p' 

,9999994394E+00 

. 999022837 OE+00 

.5303564611 E+00 

.9310260595E+00 


c' 

. 548525731 5E-01 

.4249021 31 lE-01 

. 4000401 305E+00 

.1 38099311 2E+00 

20 

p’ 

.9999986382E+00 

.9999380983E+00 

.9033324592E+00 

.9993548564E+00 


c' 

.1476422873E+00 

.91 98748428E-01 

.6266307466E+00 

.6663347228E-01 

21 

p’ 

•9999990224E+00 

. 999661 6399E+00 

.5619055084E+00 

. 964091 3984E+00 


c' 

.4247087939E-02 

. 760254251 5E-02 

. 53968371 77E+00 

.2670050566E-01 

22 

p’ 

.9999978277E+00 

.9998756379E+00 

.84631 96387E+00 

.9990676567E+00 


c' 

.1028148355E+00 

.2779428687E-01 

.6637432406E+00 

.8503767638E-01 

23 

p' 

.9999992919E+00 

.9992437394E+00 

. 57044381 90E+00 

. 92662491 33E+00 


c' 

.4881179917E-02 

.1105176014E-01 

.5328776355E+00 

.4625063032E-01 

24 

p' 

. 999999121 lE+00 

.9991934723E+00 

.8367628756E+00 

.9732659686E+00 


c' 

.3813345289E-01 

.4954637321 E-01 

.6831363057E+00 

.9331328594E-01 

25 

p' 

. 9999991 663E+00 

.9974426078E+00 

.6732643675E+00 

.OOOOOOOOOOE+00 


c' 

.3647266453E-01 

. 18981 80839E-01 

.6358960597E+00 

.OOOOOOOOOOE+00 

26 

p' 

.9999980558E+00 

.9999385496E+00 

.57638601 01 E+00 

.9029800511 E+00 


c' 

.2903556659E-01 

.4430774668E-01 

. 449741641 5E+00 

.1321740257E+00 

27 

p’ 

.9999982226E+00 

.9996096551 E+00 

. 5650661 490E+00 

.9504853272E+00 


c’ 

. 27921 14797E-01 

.1840825642E-01 

.4777731 974E+00 

.1 02791 2530E+00 

28 

p' 

. 999999281 7E+00 

•9999792257E+00 

.5938876271 E+00 

.9991027533E+00 


c’ 

•2008279598E-01 

.8168327315E-02 

.5756273053E+00 

. 249355241 5E-02 

29 

p' 

.9999997544E+00 

. 99991 28595E+00 

.64381 81 406 E+00 

.9932046896E+00 


c ' 

.1733256091E-01 

.9945088923E-02 

.6098647740E+00 

.2528598892E-01 
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Table 3(b) MLE of P and C for the 50 kbps W-B dateline 


1 

p’ 

.99999951+41E+00 

.99990046273+00 

. 53386847743+00 

.96762983963+00 


c' 

. 98 U 5264894»02 

.87696955173-02 

.51290890593+00 

. 19975717243-01 

2 

P' 

• 9999997 ^ 153+00 

.99717745333+00 

.4894906586 EM-00 

.90333944193+00 


1 

c 

.745456197^3-02 

.965384697OE-02 

. 46140245833+00 

.36356493593-01 

3 

p' 

. 9999998071 E+00 

.99904472773+00 

.43760027433+00 

, 9044 ? 34152 s +00 


c' 

.12176858233-01 

.42903374443-02 

. 41246139683+00 

.37061543573-01 

U 

p' 

.9999999611 E+00 

. 00000000003+00 

. 48797137083+00 

.94370021503+00 


c’ 

.45454826783-01 

. 00000000003+00 

. 38548665643+00 

. 1553020579 E >+00 

5 

P' 

• 99999991323+00 

. OOOOOOOOOOE +00 

. 54721635443+00 

, 00000000003+00 


c' 

.22978554663-01 

. 00000000003+00 

. 53464211243+00 

.OOOOOOOOOOB+OO 

6 

p' 

.99999997583+00 

.99991661303+00 

.47257513403+00 

. 9852605868 EM -00 


c' 

.13053833153-01 

. 44172445103-02 

.45762369823+00 

.13957963193-01 

7 

p' 

• 99999995523+00 

. 00000000003+00 

.47057099623+00 

.97700237593+00 


c' 

. 22472104263-01 

. 00000000003+00 

. 4471357312 E}+ 00 

. 26701140013-01 

8 

p' 

.99999991213+00 

. 00000000003+00 

.53425511693+00 

. 00000000003+00 


c' 

. 25184033443-01 

. 00000000003+00 

. 52080041703+00 

. 00000000003+00 

9 

p' 

.99999994163+00 

. 00000000003+00 

.47400119593+00 

.98529362943+00 


c' 

.20080787153-01 

. 00000000003+00 

.45823022673+00 

.12997219683-01 

10 

p' 

. 9999998113 J 1+00 

. 99994818483+00 

.52074920633+00 

.865389958OE+OO 


c' 

.98905317543-02 

.10614189183-01 

.4790819180E+00 

.51498356793-01 

11 

p' 

. 9999999024 iH 00 

.99998546103+00 

. 50991464773+00 

.99656541033+00 


c' 

.39656021113-02 

.52680779503-02 

.49908513723+00 

.11962902233-01 

12 

p' 

•99999981593+00 

.99968247573+00 

. 74994 12224 EM - 00 

.98727099993+00 


c' 

. 75595489973-03 

.63044453213-02 

.68379623563+00 

.80105021773-01 
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past gap. Each gap is then an independent statistical sample from the distribu- 
tion V(k), k ^ 0, The occurrence of bursts is a direct consequence of the 
form of the gap distribution. We shall therefore assess the performance of our 
model by how good a fit U(k), k ^ 0, a function of the gap distribution, 

A 

gives to the empirical U(k) - both from the wideband and high frequency data. 
(^Recall from (ll)that 


s c.p?“^ = u{k) 

L=1 

« s V(j)) 


Figures 4, 5, and 6 are representative of fits obtained from the Green, Amber 
and Red groups respectively. 

For purposes of error control it is more important to have very accurate 
prediction of error clusters when the gaps are short (high bit error rate) than 
dirring long intervals of error- free transmissions. This is why we have com- 
pared U(k) and U{k) for 0 k ^ 4000. 

Let F(k) and defined by (l4)be the distribvition functions 

associated respectively with V(k) and V(k). 


k-1 

F(k) = S V(j) 
3=0 

F (k) - i V(o) 


(14) 


Then 
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|r(k) - F^(k)i = 


on 


1 - S v(j) 

j=H 



2 v(j) 


= |u(k) - u(k)[ 


Hence 


" ' c«SSoool"<'’’ - "(''’I 

is the maximum absolute difference betv/een the model and measured (empirical) 
distribution fijtnctions in the range shown. For the Green group (Figure 4) 
p =«035j for Mber (Figure 5) p = .015; and for the Red (Figure 6) p = .012, 
which shows that the higher the bit error rate the smaller the p (the better 
the fit). This is seen from the respective graphs. 

Now write 

Y(k) = U(k) ~ U(k); Os)? UOOO . 

For a very good fit one would expect Y(k) to have very small mean, Y, and 
mean- square- error , s . A ?.ero mean would indicate that the U(k) is symmetrical 

y 

about U(K) and the small standard error as a measure of the deviation of U(K) 
from U(k} indicates that the U(k) does not deviate too vridely from U(k)* For 
the Green group Y = - -007> s - *003; for the Mber Y " - ,004, s = .002 

y y 

and for the Red Y ~ - .001, - .0008. p and Y for all the 29 error- runs 

are shov/n in Table 4(a) for the HF circuit and in Table 4(b) for the wide-band . 
Because of the wide range of error rates observed in the sample runs 

_ '3 

(0 - lO""^) it is clear that there is no way of constructing a single channel with 
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N INDIVIDUAL RUNS GROUPED RUNS OVERALL CHANNEL 



P 

Y 


P 

Y 

=y 

P 

Y 


2464200 

.022 

- .0003 

.002 

.063 

.003 

.006 

.107 

-.015 

.008 

14430000 

.012 

-.001 

.0008 

.083 

-.008 

.007 

.098 

-.22 

.007 

63632400 

.023 

-.0005 

.001 

.076 

-.003 

.006 

.091 

-.022 

.006 

28970400 

.049 

-.005 

.002 

.120 

-.036 

.005 

.101 

-.02 

.007 

48342000 

• 029 

-.003 

.001 

• O55 

-.017 

.003 

. o4o 

.006 

.007 

5500800 

.015 

-.004 

.003 

.063 

.015 

.004 

.079 

.039 

.005 

51313200 

.023 

-.005 

.002 

.063 

-.020 

.004 

.058 

.004 

.006 

61706400 

.015 

-.0008 

.002 

.056 

-.022 

.002 

.021 

.002 

.004 

30902400 

.025 

-.004 

.002 

.081 

-.044 

.003 

.065 

-.021 

.006 

64060800 

.018 

-.001 

.0006 

.145 

0 

1 

.005 

.130 

-.021 

.008 

28754400 

.022 

-.001 

.003 

.059 

-.020 

.005 

.034 

.004 

.003 

49009200 

.009 

-.0005 

.004 

.135 

.045 

.006 

.151 

.067 

.004 

61592400 

.OU 

-.001 

.002 

.048 

-.013 

.002 

.033 

.011 

.004 

34022400 

.027 

-.002 

.003 

.068 

.010 

.009 

.054 

.034 

.005 

12943200 

.017 

-.005 

.009 

.670 

.195 

.030 

.686 

.219 

.029 

7084800 

.027 

-.005 

.008 

.336 

.089 

.025 

.352 

.113 

.021 

28753200 

.029 

-.002 

.003 

.035 

-.001 

.018 

.120 

.022 

.016 

19392000 

.073 

-.002 

.004 

.050 

-.017 

.002 

.035 

.007 

.006 

48927600 

.031 

-.006 

.002 

.033 

.009 

.002 

.059 

.033 

.004 

43287600 

.024 

-.002 

.002 

.035 

-.018 

.002 

.023 

.006 

.004 

19036800 

. 023 

-.001 

.001 

.118 

-.026 

.005 

.103 

CVJ 

0 

0 

1 

.008 

28974000 

. 030 

-.001 

.002 

.067 

-.028 

.003 

.042 

-.004 

.005 

21888000 

.012 

-.008 

.004 

.033 

.016 

.003 

.119 

.078 

.008 

55360800 

.035 

-.006 

.007 

.038 

-.006 

.016 

.126 

.065 

.011 

12952800 

.514 

-.416 

.013 

.721 

.243 

.021 

.770 

.305 

.022 

55497600 

.025 

-.007 

.003 

.037 

-.017 

.008 

.109 

.045 

.004 

21592800 

.045 

-.007 

.003 

.200 

00 

0 

• 

1 

.006 

.147 

-.022 

.008 

42288000 

.025 

-.008 

.003 

.654 

.229 

.019 

.704 

.292 

.020 

340272000 

.035 

-.004 

.002 

.167 

-.069 

.004 

.129 

-.007 

.007 


Table 4(a). Curve fitting parameters for 4.8 kbps hf dataiine 

Y = U ( k ) - 0(k) 

Y = Mean of Y , Sy = standard error of Y 
p = Max | U ( k ) - U ( k )| 

0 < k < 4000 
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N 

INDIVIDUAL RUNS 

OVERALL CHANNEL 



P 

y 

s 

y 

P 

Y 

S 

y 

39198003 

0.0046 

-0.0007 

0.0004 

0.118 

0.0079 

0.0072 

42681997 

0.0053 

-0.0006 

0.0008 

0.133 

-0.0009 

0.0071 

62300445 

0.0079 

-0.0008 

0.0005 

0.188 

0.0038 

0.0084 

51451345 

0.0289 

-0.011 

0.001 

0.031 

0.0265 

0.0042 

46043171 

0.022 

-0.008 

o.(X)o8 

0.144 

0.0068 

0.0082 

41468126 

0.015 

- 0.004 

0.0008 

0.167 

0.0046 

0.0085 

44651211 

0.0247 

-0.006 

0.0008 

0.162 

0.0086 

0.007B 

44301174 

0.046 

. - 0.012 

0.001 

0.153 

0.0043 

0.0082 

51389935 

0.011 

- 0.004 

0.0006 

0.169 

0.0077 

0.0086 

54343756 

0.008 

-0.002 

0.0004 

0.119 

0.0085 

0.0074 

1 ^ 344 l 434 

0.0195 

-0.0007 

0.001 

0.155 

0.00064 

0.0078 

52020411 

0.079 

- 0.001 

0.004 

0.122 

-0.0034 

0.0051 


Table 4(b). Curve fitting parameters for 50 kbps W-B dateline 
y = U(k) - U(k) 

Y - mean of Y, S = standard error of Y 

y 

I - max iu(k) - U(k)| 

o<k< 4 ooo 
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transition matrix M which will give a good fit to each of the sample U(k) , 
k s 0. One thing that can be done is to construct a channel for each of the 
Green, j^ber and Red groups. By plotting averaged model U(k) for each of 
these groups we shall get a better picture of what is happening at comparable 
error rates. As raw estimates for the optimum averaged parameter set for each 
group we use weighted averages of the parameters and c!^ which gives the 

critical points of P(M, z) in each of the samples in that group, weighted 
according to the number of bits transmitted in the sample run. For example, if 

N. represents the mmiber of bits transmitted in sample j, j running over the 

J 

number of samples in that group, then 


2n. 


I 

J J-d 




W 


is the averaged raw estimate of 
in sample j which maximizes 
Now let 


N = E N. ; i = 1, 4 
J ^ 


for the group and 
P(M, z). 



(16) 


is the estimate of 


- 

V' ^i^i 

- \ 

c.p. 

^ 11 

\ i 


jk ^ 1^1 


(IT) 
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where k=l, 


• • * 


runs over the samples in the group and 


J=l, is the number of errors in sample k. 

= number of the error- free bits before the first error in sample K. 

= number of the error- free bits after last error in sample k. 

. .th 


''jk 


gap length in sample k. 


Then if 


k.2l 


7 r. Ik - ^ N . -1 - ^ jk 

c*p. c.p, ek c.p. 

11 11 11 

J.1 “jk 


Dp = S 
k^l 


- ^Ik - _^lk*^ 

■ ■ -■ II -f 




C.p. 




Si., " " 


■i' "'Ik 


i^“Lj^ (l-p.)"o^^ j -1 °'jk 


(18) 


D = Dj_ H- Dp , 


the optimum averaged parameter set for that group is given by: 



(19) 


Our notation here is as was used in (9), (lO), and(ll). n is the number of 
samples in the group. Two hundred iterations on the computer of expression ( 19 ) 


JPL Technical Memorandum 33-699 


41 



Table 5a MLE of P and C for the Red, Amber and Green groups 


and overalX channel; U,8 kbps dataline 


MT.'R (Optimum) Set for Group R (10 ) (Red) 

P .9999982928EH-00 .9975858132S+00 

c . 69)4.7437705 E- 02 .38114633335-01 


. 9114607347E+00 . 5645880886 Et 00 

. 11032 52624e+ 00 . 47075548135+ 00 


-4x 

MI)E for Group A (Amber 10 ) 

p .9999991048E+00 .99987335435+00 

c ,30385934885-01 .28864553155-01 


.927852762OE+OO .5752459153£H-00 

. 1214899946e+ 00 . 4658393836.5+00 


MLE for Group G (Green 10 ) 

p ,99999969625+00 .91651019855+00 

c .91481753775-01 .14505589255+00 


,5635646054e+00 .5635646054e+00 

. 97966495095- 01 . 32 543278075+ 00 


Single Channel (Parameters) for 4.8 kbps Data 


. 9999989701 E+OO 
.2346445968E-01 


.9987625 5065+00 . 91348638125+ 00 

.29379872885-01 .12110684915+00 


.5668315129E+00 

.46171997865+00 


Table 5b MLE of P and C for 50 kbps Channel; 

for k = 6 


P 

c 


•9999997795E+00 

.62265226945-02 


.90271800235+00 

.20277549555+00 


. 9967973995 E+00 
.25311770755-01 


.51389109375+00 

.38220786355+00 
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log u(k) 



Fig, 4, Gap distribution for Green error group 
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log u(l<) 


0 



Fig, 5 * Gap distribution for Amber error group 
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log u(k) 



Fig, 6, Gap distribution for Red error group 
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log u(k) 



-4 

Fig, 7, Gap distribution (30 kbps circuit, BER = 0^51 x 10 ) 
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starting with raw estimates in (l6) yield the stable parameter set for the 4.8 
kbps and 50 kbps shown in Tables 5a and 5b respectively. 

Figures 4 throvigh 6 also show comparative plots of the function U(k) 
for each group and the overall channel. As expected the overall channel does 
not predict any particular group closely enough. On the other hand, the less the 
spread among the values of the empirical bit error rate in a group, the better 
the fit between the averaged U(k) and any given sample in that group. 

Let us go back to the definition of p in (15) and use it to construct 
confidence intervals for the distribution function F(k). What we want to do is 
to construct a statistical test of the hypothesis that the true parent distribu- 
tion G(x), X > 0 is F(x) against the alternative that c(x) f F(x) using the 
metric p at significance levels o> - 0.01, 0.05, 0.10 and 0.15* That is to say 
we shall teat the hypothesis 


: G(x) - F(x) (20) 

at levels ot given above. The empirical distribution -.function is Fjj(k). Since 
F(x) is continuous we shall use the Kolmogorov- Smirnov test statistic. 

1C sup |p(x) - F (x)| . (21) 

Now we shall confine the supremura in ( 21 ) to the range of sample points k for 

0 k <4000, Indeed we shall use equally spaced 2000 points in the range. Thus 

our n = 2000 and K then becomes: 

n 

K = ^ max |F(k) - F„(k)I (22) 

0^^000 

= /n p 
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It is clear that is to be rejected if is too large. The 

different rejection regions we shall use will be <y. Since no tables exist for 
n as large as 2000 we shall use the limiting form of the distribution of 
to get the cut-off points for the test. 

Let 

" P(/n p ^ X) . (23) 

Kolmogorov showed, (see Darling [11]), that 

f^„(X) - f^(X) 

= £ (-l)V^^^^^ . {2h) 

k=-<n 

Thus for large n, approximate confidence bands for F(x) are given by 

’ where 1 - ^(X^)>= 0 '. Since F^(k) converges to the true parent 
distribution G(k) with probability one (Cantelli-Gilivenko lemma), 
thus the probability that the maximum absolute difference between G(x) and F(x) 
is at most X^/n when n is large enough. This statement Is not true for small 
n (see the modification of ^(X) by the error term ^p(^) “ l)*^fj(X)“f!^(X) [ = 0(n ^^®) 
in K. Kunisawa, et al [9 1 large as ours) but tables available for 

small n do not cover values of n as large as 2000. Kunisawa, et al, further 
point out that for a ~ 0.01 or 0.05 and the corresponding values of X^, D^(X) 
are very small for n > 100. In any case the effect of the correction factor D^(X), 
when ^(X) is used rather than jlS^(X), is to increase the rejection region fy 
thus forcing us to reject certain distributions F(x) which would have been 
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accepted as not essentially different from the true G(x) had we used |ii^(X). 


Hence the use of the limiting 

distribution 

^(X) makes the test of the hypothesis 

«o 

very conservative. 





Using tables of ^(X) 

by Kunisawa, 

et al [10] 

the metric p was calculated 

for 

the different values of « and corresponding X^ 

( see Table 5 ( c ) ) . 


Table 5 ( 

c ) . Kolmogorov- Smirnov 

Test Statistic 


a 


X 

a 

P 


0.01 

0.99 

1.628 

0.0364 


0.05 

0.95 

1.3581 

0,0304 


0.10 

0.90 

1.224 

0.0274 


0.15 

0.85 

1.138 

o'. 025 5 


We now compare the values of p with those listed in Table 4a obtained from 
each of the test runs using equation (15) and notice the following: 

1. At level a = 0.01, almost all the test runs agree that the distribution 
function predicted for each of them by the model is the true distribution. 

In other words with probability the maximum absolute difference between 

G(x) and F(x), p, is no greater than 0.0364. (Equivalently put, the error of 
our estimate is at most In the Red group each p is less than this 

value showing that at level o = O.Oi, F(x) given by the model is accepted 
as the true parent distribution. Two of the 19 test runs in the Amber group 
fail our test only slightly while also two of the seven tests in the Green 
group reject the distributions our model assigns to them. The encouraging 
fact is that the higher the bit error rate the smaller the p and hence the 
more we are wont to accept the hypothesis Remember the emphasis in this 

study is to model for very short bursts (associated with high bit error rates) 
that cause decoding errors. 
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Even if we allow as much as 15% (a = 0.15) probability against accepting 
Hp, every one of the test runs in the Red group still accepts the model 
F(x) as the true distribution. This fact gives us confidence to use our 
model for predicting significant error patterns in the worst mode of the 
channel. Notice also that we obtain good results in the Amber group. 

The grouped runs and the overall channel, however, show poor agreement as 
we have noted earlier. This is as should be expected owing to the wide 
range of error rates obtained on the channel (0 - 10 ) . But the fact that 

only ten of the test runs show errors of less than 67. indicates that the 
error patterns on the GCF do not follow a single distribution. The 
overall channel predicts the Amber error mode better than any other. 

For example, no run in the Red group shows less than 9% error while the 
Green group contains runs with as much as 77% disagreement with the averaged 
channel. Any statistics calculated using a single parameter set will there- 
fore not be valid for the different error modes and hence not reliable. 

The conclusions for the 50 kbps data are even more striking. At level 
Q^ = 0,01 all the tests except one accept the model distribution as the 
true distribution (see Table 4(b)). 
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Section III 


AUTOCORRELATION OF BIT ERRORS AND CHANISIEL MEMORY 

In the last section we presented the Maximum Likelihood Method for 
obtaining the estimates of the model parameters and demonstrated how closely 
our model predicts the pattern of errors on the GCF. While we consider 
the gap distribution or the distribution of return times to the bad state, 

V(k), k a 1, to be of fundamental importance in our model, there are func- 
tions of this distribution which play a central role in our understanding of 
the error patterns on the channel. Certainly knowledge of significant error 
patterns is necessary for error correction. The way the bit errors end error 
blocks are correlated should be known. In this section bit error correlation 
will be discussed. 

Denote by r(k), k 5 1, the auto- correlation of bit errors. That is 

r(k) = = l|Zo = 1); k ^ 1 . (25) 

By definition r(k) is the probability of having an error at time k following 
a given initial error. Let 

G.(k) = P(sj^ = Cx^|s^ = B); i - 1, 4 
B(k) = P(Sj^ - Bis^ = B) . 

Then 

r(k) » B(k) . 

The following recursions are satisfied; 


(26) 


(27) 
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B(k 4 1) qB(k) 4 S (1 - p.)G.(k) 

• 1 


G.(k 4 1) = c.B(k) 4 p,G.(k) 
1 ' 1 ' ' Ml'' 


i =• 1, 4 . 


Equivalently, 


(Gj(k4l), Gp(k4l), G3(k4l), Gj^(k4l), B(k4l)) = (G^(k), G2(k), G^Ck), G^(k), B(k))M 

(28) 


where 


M = 



0 

p 

'2 

0 

0 

c. 


0 

0 

^3 

0 “ 

r 


0 

0 

0 


I’P. 

I-P3 

1-Pi. 


is the transition matrix. Using the initial conditions 


G^(0) =0, i = 1, 4; B(0) - 1 


we can write (28) as 

(G^(k4l), Gg(k4-1), G2(K41), Gi^(k4l), B(k4l)) = (0, 0, 0, 0, (29) 

Using the method outlined in Appendix II, for some selected values of k, 
the matrix ^ was found. For example, for k = 5, and for each of the error 
groups and the overall channels (both for the 4,8 and the 50 kbps data), is as 
shown in Table 6. Above method is not the only one available for finding the 
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autocorrelation r(k). That method is developed specifically for use in 
Section V for finding the autocorrelation of error-blocks and is presented 
here only for comparison with the following more directly programmable method . 
By definition 


r(k) = P{0^1 — k-J-2— ijl}; j = 0, k - 1 


(30) 


The (k-j-l) bits indicated are 'any (k-j-1) binary digits. Hence we can write 


k-1 

r(k) - 2 P(0'^l|l)p{0"l— k-rn-j-3^lil]; m-0, k-j-2 
j-0 


that is 


r(k) - S V(j)r(k-j-l) (31) 

j>0 

subject to r(0) -= 1. Call the above methods, Methods I and II respectively. 

r(k), for k= 6 and for both methods, is shown in Table 7 for comparison. r(k), 

- 3 

for k = 1200, range from .0278 for bit error- rate P^^ ~ 10 “ to 0.0000033 

cz 

for P^ 10"' with value of 0.007 for the averaged (overall) channel. For the 
50 kbps data, r(l200) =- 0.0382; showing in each case that the memory of the 
channel is longer than 1200 bits . It is only in the Green group 
(Pj_ « 0.3258E-05) that the bit correlation, at 1200 bits apart, of 0.332889S-05 
is closest to Pj^ showing that in this group the memory is almost 1200*bits. 
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Table 6. for k - 6 

“3 

35rror-»rate 10 


.9999899 

.55E-06 

.140S-05 

.358s- 05 

.461B-05 

.0001U3 

.9863834 

.001973 

.00537 

,006463 

.004575 

.024998 

.6359246 

.1539447 

.1805568 

.01347 

.073556 

.17742 

.40545 

.33009 

.01874 

.10204 

.22498 

.35689 

.2973375 

-4 

Error- rate 10 





.9999948 

.215E-06 

,8068s- 06 

.182B-05 

.23OE-05 

.3204e-04 

.9992708 

.0001141 

.0002575 

.0003256 

.01626 

.015445 

.69563 

.12573 

.14693 

.05634 

.053509 

.19305 

.39236 

.30473 

.07814 

.07420 

.24742 

.334208 

.266023 

-5 

Error- rate 10 





■ .9999984 

.313E-06 

.124s- 06 

.412E-06 

.747E-06 

.0543 

.66724 

.02838 

.094289 

.15578 

. 16635 

.219696 

.084864 

.2818996 

.27923 

.16635 

.219696 

.08486 

.2818996 

.27923 

.22482 

.27038 

.06261 

.20799 

.23419 

Over-All Channel 

(4.8 kbps) 




9999940 

.255E-O6 

.91782-06 

,2089s- 05 

.2714e-05 \ 


f .0002447 

.9929 

.00110 

.0025 

.003247 


. 01494 

. 01866 

.6476 

.1459 

.17287 


i .04465 

.05576 

.19163 

.39096 

.31699 

V* 06182 

.07710 

.24198 

.33768 

.28122 / 

Over- All Channel (50 kbps Data) 





^.9999987 

.328E-fl6 

.475E-07 

.349s- 06 

• 5866 s - o 6'\ 


f .00444 

.66362 

.0179497 

.124576 

.1894 m 


. 0001697 

.004734 

.981624 

.00504 

.00843 


i .01255 

.3303 

.050667 

.293315 

.31321 j 

01656 

.394816 

.066642 

.246665 

.275713 y 
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Table 7. Autocorrelation r(k) for 1-6, 
High-Speed Circuit, 



Method I 

Method II 

Red 

.?97337 

.297337 

Amber 

.266023 

.266023 

Green 

.231+194 

.234876 

Overall Ch. 
(U.8 kbps) 

.28122 

.281232 

Overall Ch. 
(50 kbps) 

.275713 

.2757125 
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Section IV 


THE CHANNEL CAPACITY 


For estimating the maximal rate for which reliable transmission over the 
channel is possible we shall now calculate the capacity of the channel. 

Following Gilbert [4], the capacity, C, of the burst- noise channel 
is given by 


C = 1 - H 


(32) 


where 


H = lim S P( . . . > Vl I * 

z . = 0 or 1 


As shown in Appendix III, H can be written in terms of U(k), k ^ 0 


as 


K = - 


E U(k) 
k=0 


i u(k+l) 
) Ufk) 


log 


U(k4l) 


4 



(33) 


H is the entropy of the noise sequence z - Since 


U(k) 


k 

= S 

i=l 


k- 

c.p. 

11 


k-1 

for large values of k, u(k) c^^p^ where is the largest of the p’s. 

Thus ^ sufficiently large k =• k^ say. So that for all k ^ k^ 
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can approximate the surunand in (33) "by 




(3U) 


and H can then be written as 


H 




U{k4l) 

-Tf(^ 


Vi? 

k 

o 


Using the fact that 


T E U(k) 
k=k 

o 


E P(io^) 

k=ko 

V. -1 

= 1 - P E U(K) . 
■'■ k =--0 


k -1 

= 1 - E ?(10^) 

k-0 


We can approximate II by the finite sum: 


H 


k -1 
o 




k -1 
o 


1 - E U(k) 

^ k-0 


log 


U(k-fl) 

u(iO 



and thus 
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c = i- h«i-h' 


C was evaluated on the cOiT.puter for values of from UO to 1?00 in steps 

of 20 and for both the wide-band and the high frequency model parameters. In 
each case C converged (to six decimal places) for - 1000 while for many 
of the runs is very much smaller. 

Table 8 shows the model values of C for each of the 29 error runs 
of the U.B kbps high freq\iency data, the three groups Red, Mber and Green and 
for the overall channel. The lowest capacity is C = 0.99^'^ ('the C for a 
binary symmetric channel with the same bit error rate, C(BSC), is 0,996) and 
the highest C = 0.999998^+ (C(BSC) - 0.9999978). The capacity for the Red group 
is 0 . 9994 , Amber group C = 0.9999? » Green group C - 0 . 999988 and the overall 
channel capacity is 0 , 99988 , 

The capacity for the wide-band error runs i.s shown in Table 9. In 
this case C ranges from a minimum of 0.9993 to a maxinuun of 0.999997- The 
overall channel capacity is 0.99991 which is only 0.003 of 1 $ higher than the 
capacity of the average channel at high frequency. 

At 4.8 kbps during the highest bit error phase (discounting the outages) 
we can still transmit reliably at rates close to 0.9986 while the rate is 0,9993 
for the 50 kbps. During the Green phase (at 4.8 kbps) the worst we can achieve 
is 0.999983 while for the 50 kbps during the least bit error mode the worst is 

0.999992. 

We should remember however the recording problems, mentioned in [2j, 
encountered in the gathering of the wide-band date which have the effect of 
increasing the error- free gap lengths at the end of each test run. These have 
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the effect of lowering the bit error rate and thus increasing the capacity of 
the channel, based on wideband (50 kbps) data. 

The calculations based on the 4.8 kbps high frequency data are thus more 
reliable . 
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Table 8. High Speed (4.8 kbps) Channel Capacity 



Group 

Bit ^ Rate 

p 
- 1 
_i_ 

c 

C (GROUP) 

C(B5C) 

c(esc)/c 

1 

Red 

. 5062-03 

.998580 


.995653 

.997 


. 2414 - E- 03 

.229E-03 

.999576 

.999^ 

.997853 

.9983 

3 


.IU5F-O3 

.999716 


.9^)8576 

.999 

k 


. 1112-04 

.999978 


.99966 

.9999 

5 


. 915 E-Olt 

.99988 


, 99944 

.99956 

6 


.3032- Oh 

.9999 


.99965 

.99975 

''7 

( 


. 105 E "04 

.999977 


.99987 

. 9999 

8 


.43r-o4 

.9999 


.99952 

.99963 

Q 

y 

Ainber 

.l 6 hE- 0 h 

.999965 



.99984 

10 


.827F-09 

.9995185 

.99992 

.999895 

.99991 

11 

. 2931 '> 0 U 

. 99 E -04 

,99994 


.99976 

.99982 

19 


. 3932-04 

.99987 


.99956 

.99969 

13 


. 119 S -04 

.999969 



.99988 

Ik 


. 102 E -04 

.99997 


.9999 

.9999 

15 


.909R-09 

.99995 


.99989 

.99994 

16 

1 

1 

. 21 E -04 

.9999 


.99975 

.99985 

17 


.P 3 E -04 

.99991 


.99973 

.9998 

18 


• 229 E -04 

.99994 


.99973 

.9998 

19 


. 639 E -04 

.9998 


.9993 

.9995 

9.0 


. 635 F -04 

.9998 


.9993 

1 .9995 

21 


. 353 E -04 

.99993 


.9996 

.9997 

22 


.i4ie-04 

.9999? 


.99983 

.9999 

23 


,453s- 05 

.999983 


.999943 

.99996 

2k 

Green 

.303E-05 

.999989 


,999958 

.99997 

25 


.128s- 06 

,9999984 

.999988 


.9999994 

26 

• 332 E -05 

. 189 E -05 

.999993 


.99997 

.99998 

27 


.3293-05 

.999994 



.99996 

28 


.162E-05 

.999989 


.999977 


29 


.439s- 05 

.9999898 



.999952 


For the overall channel: = .U3 Be~0^, C - 0.999^^ 


C(BSC) = 0.9995, = 0.9996 
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Table 9 . 


Wideband (50 kbps) Channel Capacity 


Overall 

Channel 

Bit- Rate 
^1 

C 

C (Overall) 

C(BSC) 

C(BSC)/C 


, 46 ie-o 4 

.99992 


.99949 

.9996 


. 347 E -04 

.99994 


.9996 

.9997 


. 15 Bb-o 4 

.99997 


.99981 

.9993 

. 354 e -04 

. 856 b- o 6 

.999997 

.99991 

.999987 

.99999 


.38E-05 

.999992 


.999950 

.99996 


.185E-05 

.999997 


.99997 

.999977 

1 

. 199 E -05 

.999996 


.99997 

.999976 

1 

.349B-05 

.999993 


.999953 

.99996 


.29IB-05 

.9999946 


.99996 

.99997 


, 19 E- 0 U 

1 

.999965 

! 

.99988 

.99981 


. 24 Ue-o 4 

.99996 


.9997 

.99976 


. 242 B -03 

.9993 


.9977 

.998 


C = C(BSC) for overall parameter 0,9996 
C/c (Overall) 0.999T 
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Section V 


BLOCK-BIT STATISTICS 

A block o£ size n, (also referred to as a message) is defined as n con- 
secutive bits of data. For the GCF, n - 1200. In data transmission a block 
is considered as correct ^ in which case it is accepted, or incorrect, in which 
case it is ignored or retransmitted. Thus the proportion of blocks to be ignored 
after possible repeated retransmissions is a measure of the performance of the 
forward error correcting code employed on such a channel. This is one reason 
that the block error statistics is a very important group of distributions to 
be evaluated on the GCF. 


(i) Block Error Rate as a Function of Block Size 

Let us start this section by calculating the block error rate. An error 
block is defined as a block having one or more error bits. An unbiased estimate 
of block error rate or of the probability of a number of errors in a 1200 -bit 
block from the data is obtained by supposing every bit in the test run is a pos- 
sible beginning of a block. In doing this, each run is divided into consecutive 
blocks 1200 bits long starting at the i — bit of the run (i= 1,2, ... ,1200) and the 
number Nj^(k) of blocks containing k bit-errors is noted. We thus obtain, for each 
i= 1,..,,1200, a probability Pj^(k,1200) = Nj^(k)/N that a block in the subdivision 
contains k bit-errors or Pj(k,1200) as the block error rate in the subdivision. 
(N is the total number of blocks in the subdivision) . Now average over all the 


possible 1200 starting positions and take the probability P(k,1200) that k errors 

1200 

occur in a block of length 1200 to be Y'. Pi(k,1200) and P(k,1200) to 

be the block error rate. 


From Appendix IV, the probability of no errors in a block of size n is 


given by: 


P(0,n) 



(35) 
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Hence the probability of getting an error block is given by 

P(error block) = 1 - P(0, n) . (36) 

Remember that 0 < p. < 1 for j = 1, U. Hence in (35), goes down to 

J J 

zero as n becomes large. And hence P(0, n) goes down to zero for large n. 
Therefore by (36) P(error block) goes up to 1 as n gets larger. This is as 
should be expected: if the bit error rate is not zero, that is to say if it is 
possible for error to occur on the channel at all, it will occvxr eventually. So 
that any block that is almost as long as the total test run is sure to include 
the error bit. 


The empirical and predicted block error rates for n = 1200 are shown in 

Table 10 for the 4.8 kbps and in Table 11 for the 50 kbps data. For the 4.8 kbps 

data, the block error rate ranges from a low of 0.021 of 1^ during the Green 

phase of transmission to as high as 1.8% during the high bit error mode. The 

predicted values are O.OO66 for Red, 0.0021 for Amber and 0.00037 for the Green 
groups with overall value of O.OOI56 for the averaged channel. The block rates 
for the wideband data range between a low of O.OO9 of 1% and a high of 0.12 of 
1%. The predicted value for the overall channel in this case is 0.00073. 

In both cases, as shown in the tables, the predicted and empirical values 
agree very closely. 


(ii) Distribution of the Number of Errors in a Block - P(k, n] 


If a block is in error how many of its bits have been received in error? 


What is the average probability of an undetected block error for block codes? To 


answer questions like these we need to know P(k, n). 


Appendix IV shows that 


where 


n-k 

P(k, n) = P S U(X)P(k-l, n-;M) 
^0 



(37) 
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and hence, for example 


P(0, ti) - U(n) 


U 

= T> 

i=0 


n-1 

c,p. 

11 


n-1 _ 

P(l, n) = P S U(<e)p(0, n-jJ-1) 

je=o 

n-1 

= P, E U(A)U(n-/;-l) . 

je=o 

P(k,n) , for n = 1200 was evaluated for each of the 29 error runs on 
the HF circuit. Figures 8, 9, and 10 are some of the graphs of the probability 
of k or more errors in a block, P(sik,n) , k = 0,1,.. *,n, for block length of 
n = 1200 bits as given by the data and the model for each of the different 

error modes. To emphasize the effect of the wide range of the bit error rate 

-3 

(0-10 ) on the block error distribution, the predicted P(:ak,n) by the overall 

channel is plotted on each of the graphs (to same scale)* For example, in the 
Red group, of all blocks containing errors, 64*787o contain 25 or more errors 
and 36.927o have 50 errors or more while the percentages are 9.26 and 0.77 respec- 
tively for the Green group. The overall channel predicts 37*00% and 12.29% 
for ^25 and ^50 errors (see Table 13)* Tables 14 and 15 show the proportion 
of all the blocks that were correctly received compared to the expected pro- 
portion for both the HF and WB circuits* 

Some of the codes now being considered for use on the GCF can correct 
burst of errors in a block if all the errors in the burst are confined to within 
a given length apart* In other words, if the distance between extreme errors in a 
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log P (k OR MORE ERRORS IN 1200-bi^ BLOCK) 



NUMBER OF BIT ERRORS 


Fig, 8, Distribution of errors in a block (4,8 kbpsj Green group) 
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log P (kOR AAORE ERRORS IN 1200-bit BLOCK) 



NUMBER OF BIT ERRORS 

Fig, 9. Distribution of errors in a block C^.8 kbpsj Amber group) 
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log P (k OR MORE ERRORS IN 1200-bit BLOCK) 



Fig. 10. 


Distribution of errors in a block (4.8 kbpsj Red group) 
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tog P (W OR MORE ERRORS IN 1200-bi^ BLOCK) 



NUMBER OF ERRORS 


Fig, 11, Distribution of errors in a block C50 kbps, line; 
bit rate = 0,52 x 10"'"^) 
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Table 10 HF circuit (U.8 kbps) 


Block ‘Brror Rate 



P(blk Err) 

P(blk Err) 

^1 

1 

.018 

.018 

.505E-3 

2 

.00068 

.00064 

• 4535>-5 

3 

.OOOUl 

.00027 

.915E,-5 

4 

.0025 

.0025 

.507E-4 

5 

.0028 

.0028 

.2997S-4 

6 

.00057 

.00047 

.935E-5 

7 

.0021 

.0021 

.431R-4 

8 

.00048 

.00040 

.298p>5 

9 

.00095 

.000072 

.795E-7 

10 

.00024 

.00017 

.156E-5 

11 

.00043 

.00039 

.197E-4 

12 

.00013 

.000081 

.423E-5 

13 

.00018 

.00011 

.63E-5 

Ik 

.0013 

.0013 

.211E-4 

15 

.0054 

.0053 

.29E-4 

l6 

.00069 

.00069 

.17E-5 

17 

.00076 

.00077 

.129E-4 

18 

.00021 

.00016 

.445E-5 

19 

.0011 

.00097 

.978E-5 

20 

.0035 

.0032 

.915E-5 

21 

.0035 

.0035 

.22S-3 

22 

.0051 

.0046 

.21E-4 

23 

.0024 

.0025 

.l46s-3 

2k 

.0022 

.0022 

.23E-4 

25 

.0013 

.0012 

.229E-4 

26 

.0054 

.0058 

.638E-4 

27 

.0033 

.0035 

.634E-4 

28 

.0015, 

.0013 

♦353E-4 

29 

.00066 

.00054 

.14e-4 


Table H W-E circuit (50 kbps) 


Block Error Rate 



P(blk I;lrr) 

P(blk Err) 

V 

‘I 

1 

.0012 

.0012 

2 

.00054 

.00046 


.00034 

.00029 

4 

.000078 

.000078 

5 

.00012 

.000077 

6 

.0001 

.00009 

7 

.000093 

.00008 

8 

.000094 

.000077 

9 

.00011 

.00008 

10 

.00054 

,00048 

11 

.00056 

,00049 

12 

.0049 

0 

0 

Table 12 Group Block Error Rate 


Group 

P(blk Error) 



Red 

.0066 



Amber 

.0021 



Green 

.00037 



Overall 

HP 

Channel 

, 00156 



Overall 

VJ-B 

Channel 

.00073 
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Table 13 ' ( 

a). HF 

4.8 kbps data. 

Table 13 (b). W- 

B 50 kbps ‘ 

data . 

^ of error blocks containing 

io of 

error blocks containing sk 

errors 

, P(ak,n 

) data P(^k,n) 

errors, P(sk, 

n) data 


model 

estimates, for block length 

P(at 

,n) model estimates 

, for 

n = 1200 bits. 



block length 

n = 1200 

bits . 

P(s 25 ,n) P(s 25 ,n) 

P(^ 50 ,n) 

P(^ 50 ,n) 

P(^5,n) 

P(s 25 ,n) 

i P(^ 50 ,n) 

P(s 50 ,n) 

21.64 

56 .46 

l 4 .l 4 

23.96 

59.28 

61 . 46 

35.68 

36 .86 

8.34 

4.9 

7.53 

0.21 

67.60 

76.96 

44.32 

58.01 

35.78 

54.73 

23.91 

29.11 

78.49 

67-75 

56.65 

44.94 

18.47 

36.00 

12.02 

12.35 

29.78 

29.91 

?4.8o 

8.47 

21.95 

14.05 

2.62 

1.77 

50.65 

54.89 

45.34 

29.33 

21.54 

35.77 

17.24 

12.18 

93.19 

62.31 

79.22 

37.82 

22.70 

38.17 

14.02 

13.51 

84.00 

54.77 

56.06 

29.15 

9-07 

5.87 

4.27 

0.28 

65.71 

52.04 

59.84 

26.31 





68.42 

58.37 

40.56 

33.98 

8.53 

9-25 

8.44 

0.69 

71.10 

58.67 

43.91 

33.56 

68.92 

68.26 

56.53 

45.55 

88.36 

73.78 

61.18 

53.19 

52.45 

68.37 

48.53 

45.93 

64.92 

80.60 

50.26 

62.21 

55.38 

72.80 

47.94 

51.87 





29.37 

29.35 

13.05 

7.60 





3.10 

5.7 

1.87 

0.27 





11.44 

29.49 

8.84 

8.08 

Table 

L 3 (c). 

Group estimate of % 

6o.o4 

47.68 

33.80 

21.84 

of 1 

error blocks containing ^k 

13.40 

12.44 

6.23 

1.28 

errors for block length 

4.23 

0.02 

1.97 

0.003 

n = 

1200 bits 


65.05 

74.59 

51.38 

54.57 





6.24 

0.67 

3.49 

0.38 

Group 

P(s 25 ,n) 

P(> 50 ,n) 

59.45 

72.58 

39.48 

51.21 

Red 


64.78 

36.93 

12.54 

14.58 

5.01 

1.56 

Amber 


22.56 

4.71 

24.96 

33.73 

6.11 

10.51 

Green 


9.26 

0.77 

9.16 

15.60 

3.76 

2.22 

Av. 4.8 kbps 

37.00 

12.29 

25.52 

32.56 

14.49 

9.86 

Av. 50 kbps 

74.94 

52.57 

21.00 

46.37 

15.50 

20.73 
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Table 14 Empirical (P(0,n)) and 
predicted (P(0,n)) thruput 
for the H-F U.B kbps circuit, 
n = 1200. 



Group 

P(0,n) 

P(0,n) 

Group 

(P(0,n) 

1 


.982 

.982 


2 

Red 

.99654 

.9965 

.9934 

3 

(xlO‘3) 

.99764 

.99745 


h 


.99959 

.99973 


5 


.9975 

.9975 


6 


.9972 

.9972 


7 


.99943 

.99953 


8 


.9979 

.9979 


9 


.99957 

.9996 


10 


.99982 

.999894 


11 

Amber 

.99874 

.9987 


12 

(xlO-^) 

.9946 

.9947 

.99789 

13 


.99924 

.9992 


Ihr 


.9989 

.999 


15 


.99655 

.9968 


l6 


.99491 

.9954 


17 


.99782 

.9978 


18 


.99874 

.9988 


19 


.9946 

.994 


20 


.9967 

.9965 


21 


.99855 

.99867 


22 


.99934 

.99947 


23 


.99932 

.99936 


2 k 


.99952 

.9996 


25 


.99991 

.99993 


26 

Green 

.99976 

.99982 

.99963 

2? 

(xlO"^) 

.99987 

.99992 


28 


.9993 

.9993 


29 


.99979 

.99984 



Overall Channel P(0,n) = 0.99785 


Table 15 Empirical (p(0,n)) 
and predicted (p(0,n)) thruput 
for the W-B 50 kbps circuit, 
n = 1200. 


- 

P(0,n) 

P(0,n) 


.9988 

.9988 


.99946 

.99954 


.99966 

.99971 


.99992 

.99992 


.99988 

.99992 


.9999 

.99999 


.99991 

.99992 


.99991 

.99992 


.99989 

.99992 


.99946 

.99952 


.99944 

.99951 


.9951 

.996 


Overall Channel P(0,n) = 0.99927 
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burst is not too long we shall be able to correct all of those errors even with- 
out the use of feedback. This is why it is important to know! 

(iii) Dis t ribution of Distances Between Extreme Errors in a Bloch . 

Since any error correcting code will be able to correct at least one error 
especially if it is the only one in a block, we shall find this distribution for 
those error blocks vrith two or more errors . 

Let p, denote the probability of exactly k bits between extreme errors 
In s block of length n given that the block contains at least two errors. Then 
as shown in the Appendix, 

n- k- 2 

P^r(k4l) Z) U(ni)u(n-k-2-m) 

p ^ (38) 

1-P(0,n)-P(l,n) 


where r(k) - P(l, [l ) is the autocorrelation of bit errors and P(0, n) and 
P(l, n) are respectively, as fovind above, the proportions of blocks that are 
received correctly and those that contain exactly one error. 

As much as l.?5^ of all blocks transmitted may contain more than two 
errors (Table l6) (about O.U8fj in the wideband circuit) while less than 0,09 of 
1^ (O.OOhfo for WB circuit) contain exactly one error in an error block which can 
be so easily corrected. It v/ould thus be essential to use a '’burst trapping" 
code on the GCF if a high proportion of the error blocks have their bursts con- 
fined to a correctable length. The length of a burst of. errors in a block is the 
number of bits between the extreme errors in the block whatever the density of 
errors therein. 
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Examination of the HE data shows that an average of only 16. 4% of the 
error blocks in the Red group have their bursts confined to within a length of 
twenty-five bits (22% to within fifty bits) while in some runs in the Green group 
almost all the bursts are confined to within a maximal length of 25 bits (Table 16)* 
However in every one of the test runs a high percentage (Table 18) of the error 
blocks have all their errors at exactly a distance of twenty-three bits from 
the first error in the block. This is the effect of the fixed error pattern 
caused by the code built into the circuit modem. The code causes bit errors at 
18r^ and 23^ bit positions after a random bit error For example, in the Red 
group, as much as 24.2% of all the bursts in the error blocks are due to this 

effect; in the Amber group there is a run with as high as 83% while a percentage 

of as high as 98 is recorded in the Green group. The smaller the percentage of 

this fixed error pattern the better the agreement between the data and predicted 

values of the error bursts* 

A block length of 1200 bits is so long compared to the effect of the 
fixed error pattern that the two errors caused by the modem code fall, in most 
cases, within the block having the affected random error. Thus only few blocks 
should contain exactly one error which may occur either at the beginning of the 
block (within the first four bits) or at the end (within the last 18 bits)* Other- 
wise an error block would have at least two errors. This fact explains why the 
empirical probability of exactly one error in an error block, P(l,n), is so 
low (see Table 16) * 

In the wideband circuit these fixed error effects are not as pronounced 
although there are some jumps as high as 22% in the block error bursts at 
a distance of exactly twenty-eight hits from the first error in that block 
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Table 16 HF 4.8 kbps circuit 


Proportion of error blocks containing two 
or more errors and whose errors are con- 





fined to 

not more 

than (i) 

25 bits. 

(ii) 50 bits 



Group 

P{l,n) 

P(kS2,n) 

P(k^5l 

^2)% 

P(k^5t 

^2)i 

Group 

P(k^5| 

&2)fo 

P(k<50l 

P(k^50! 

>:2)i 

Group 

P(k^50[ 

S2)fo 

1 


.34e-03 

.0175 

25.4 

9.4 


28 

14.8 


2 

Red 

.42S-4 

.0034 

9.4 

11 

10.3 

18.4 

19.6 

17.2 

3 


.33E-4 

.0025 

14.4 

11.1 


20.0 

20.1 


4 


.68b-5 

.00027 

51.7 

16.0 


53 

39.1 


5 


.llB-3 

.0024 

58,8 

29.7 


78.2 

48.4 


6 


.22E-3 

.0026 

54.5 

34.7 


67.8 

50.8 


7 


.20B-4 

.00045 

43.1 

31.5 


64.5 

51.4 


8 


.8e-4 

*002 

42.8 

25.3 


61 

42.1 


9 


.61E-5 

.00038 

15.1 

11.7 


16.8 

20.0 


10 


,i4e-5 

.0001 

33 

13.3 


41.6 

24.1 


11 


.62E-4 

.0012 

27.7 

26 .6 


48.2 

42.2 


12 

Amber 

.592-3 

.004? 

60.1 

45.6 

34.7 

90.4 

68.3 

54,0 

13 


.38e~4 

.00073 

51 

31.9 


77.4 

52.5 


14 


.79s- 4 

,0009 

68.9 

39.7 


72.9 

56.8 


15 


.91E-3 

.0023 

90.3 

47.9 


90.4 

70.4 


16 


.832-3 

.0037 

74.2 

45.7 


81.00 

65.5 


17 


.i6e-3 

.002 

52.5 

20.6 


66.3 

33.7 


18 


•54e-4 

.0012 

27.9 

28.4 


4o.5 

46.4 


19 


.43E-3 

.0053 

47.9 

42.2 


60.5 

64.3 


20 


.i6e-3 

.0034 

43.4 

27.4 


62.6 

42.9 


21 


,42E-4 

.0013 

60.9 

27.3 


78.6 

47.0 


22 


. i64f/-4 

.00051 

42.2 

21.8 


59-9 

35.9 
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Table 16 Cont'd 


Group 

P(l,n) 

P(k>-2,n) 

P(kS25l 

> 2 ) 1 o 

P(ks25! 

Group 

P(k^5| 

s2 )'5g 

P(k^50| 

>2)fo 

P(k<50| 
^2 )f. 

Group 
P(k<:50| 
&2 )t 

23 

.76e-U 

.00057 

67 

51.0 


91.4 

76.0 


2 k 

,lt4E-4 

.00035 

70 

47.4 


88.4 

69.5 


25 


.00005 

99.9 

74.3 


100 

93.4 

70.2 

26 Green 


.00016 

18.8 

41.7 

48.0 

59 

60.3 


27 

.13E-5 

.00008 

3.53 

16.4 


47.0 

30.1 


28 

.23E-3 

.00046 

95.9 

54.8 


99.96 

79.6 


29 

.50S-5 

.00016 

37.3 

23.8 


55.4 

36.6 




Average P(k^5j^2) = 

46f,; 

P(k?c25l&2} 

= 24.2fo 





P(k^s50|52) = 

63. 2f,; 

P(ks50f52) 

= 38.9^ 
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Table 17 


W-B 50 kbps Circuit 


Proi-'ortion of error blocks containing two or 
more errors and v/hose errors are confined to 
not more than (i) ?5 bits, (ii) 50 bits 



P{l,n) 

?(k>?,n) 

?(k'-'25! 

P(k<4?5l 

>2)i 

Overall 

F(k:c25| 

>2)i 

P(k^50l 

P(k^50! 

>2)i 

Overall 

P(k<50| 

>2)i 

1 

.eUiv-h 

.0012 

6.4 

19 


36.5 

33 


? 

.78k- 5 

.00046 

19.2 

10.9 


22.8 

19.6 


3 

.b6K-s 

.000? 9 

U.5 

17.7 


19.5 

31.3 


h 

.38E-6 

.00007 

4.6 

28.2 


10.9 

44.0 


5 

.19s- 5 

.000f)r5 

25.1 

24.5 

8.6 

27.8 

43.0 

13.3 

6 

.8b>-6 

.0001 

3.4 

20.8 


7.1 

35.2 


7 

.20B-5 

.00008 

4.6 

24.3 


10.0 

39.5 


8 

• 21E-5 

.00007 

31.5 

27 


33.8 

46.7 


9 

.i8e-5 

.00008 

24.5 

23.6 


30.4 

39.6 


10 

• IOB-4 

.0004? 

15.7 

20.8 


22.7 

36.9 


11 

.6IB-5 

.0048 

3.1 

13.3 


10.8 

23.6 


12 

.34e-U 

,004 

7.^ 

4.0 


10.2 

6.8 
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Table 18 

HF (4,8 kbps) circuit. 
Values of k at which impulsive 
increase in P( — 1^2) occurs 


7 23 

8 23 

9 23 

10 23 

11 23 

12 23 

I23 


15 23 

16 23 

17 23 

18 23 

19 23 

20 23 

21 23 

22 23 

23 23 
2h 23 

25 23 

26 23 

27 

28 23 

29 




2b. 2 
i+.95 

9.1 
10.1 
39.^ 

44. b 

34.6 

36 

13.1 

31.2 

25.3 

49.8 

28.1 

10.2 

55.8 

83.0 

65.7 

46.7 

25.0 

35.8 

35.9 

38.8 

27.4 

56.5 
59 

98.1 

12.3 

40.4 

89.8 

17.9 

16.5 


Table 19 

W-E (50 kbps) circuit* 
Values of k at which impulsive 
increase in P( — 1^2) occurs 




k 

IV 

JV) 

1 

2 

3 

28 

l6.4 

4 

5 

28 

22.1 

6 

— 

— 

7 

8 

25 

28.5 

9 

51 

21.3 

10 

28 

12.4 

11 

— 

— 

12 

— 

-- 


Table 20 

Predicted distribution of errors 
in interleaved codes 
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log P (> k bits BETWEEN EXTREME ERRORS) 



NUMBER OF BITS k 


Distribution of distances between extreme errors 
in a block kbpsj Red group 
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log P (> k bihs BETWEEN EXTREME ERRORS) 



Fig, 13, Distribution of distances between extreme errors in 
a block (4.8 kbps; Amber group) 
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log P ( > k blU BETWEEN EXTREME ERRORS) 



Fig, 14 • Distribution of distances between extreme errors in a 
block (4.8 kbps; Green group) 
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log P (> k bus BETWEEN EXTREME ERRORS) 



Fig, 15, Distribution of distances between extreme errors in a 
block (50 kbps; bit rate = 0,52 x 10”^) 
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(Table 19), Which explains why in this case the data and predicted values are 
much closer (Table 17); 

In a seperate paper to deal with comparison of code performances on the 
GCF, a strategy will be developed to correct for these fixed error patterns 
before assessing the capability of any code on the channel. 

A rather simple way to correct burst noise is to interleave the coded 
blocks to some depth t, say. Here the bits of each block are not transmitted 
consecutively but rather the bits of every t coded blocks are interspersed in 
such a way that the once consecutive bits are transmitted separated exactly t 
bit positions apart. Iifhen re-ordered at the receiver the blocks appear to have 
been corrupted by random errors, for sufficiently large depth t. It is interest- 
ing to observe the effect of interleaving on the distribution of block errors. 

The random error effect is to spread out the bit errors among many more blocks 
than woiild otherwise have been affected by the action of the bursty channel 
alone on the un- interleaved blocks with the result that the namber, and hence 
the proportion of error-free blocks after the interleaved blocks are commuted 
together again at the receiver, is lower than without interleaving. But the 
error blocks nov/ contain fewer errors. Thus if a code that can correct up to, 
say, r errors in a block is interleaved appropriately, many more blocks than 
otherwise would now be decadable. Hence a measiire of performance of interleaving 
is the percentage increase in the proportion of decadable blocks. This is the 
main idea behind interleaving. 

Suppose we interleave a block code of length W to depth t. That is 
each of the blocks is divided into t sub-blocks as shovrn in (39)* 
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» • 


1 t+1 Ct+1 


1111 ' • * 11 
2 t+2 ct+2 



1 


(39) 


^t'^2t 




Each sub-block is seperately encoded, and decoded before they are finally commuted 
together again. Let X,, X^, . . . , X^. be the number of errors in sub-block j, 

0 = 1, 2, t. By the time independence of our model (which in practical 

terms implies that the channel is as likely to introduce errors say in the first 

sub-block as it is in any other) it is not difficult to see that the X.’s are 
identically distributed. But they are not independent (because of the burst 
phenomenon). Now suppose the interleaved code can correct up to r errors in 
each of the t sub- blocks. Then we shall be able to correct up to a total of 
rt errors in the whole block of length N if no one of the component t sub- 
blocks contains more than r errors. Thus we should find the probability that 

(x^ 5 r ^ Xp ^ ^ > ♦ • • ^ ^ rl . 

First we find the distribution of the X’s denoted by P^(k,n). 

(iv) Distribution of Errors in a Code Interleaved to Some Depth C 
It is shown in the Appendix that 

n-k 

p*(k,n) - P*(l) S U^(£,)P (k-1, (hO) 

A'O 


where 
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n- jf^-k 

U (k) - P (0^[l) 

^ ^ k 

V^(k) - P^(0 l|l) 

P. (1) = bit rate 
t 

N 

all calculated using the t-step transition probabilities. Here n = -r is the 
length in bits of each of the t sub-blocks. 

Let us interleave each block to depth t - 6 so that n = 200 bits. In 
this case for the Red group the thruput P^(0>u) is as high as 0.99386 with 
only 0.^5 of 1^ containing fo'or or more errors. In the good mode (Green group) 
the thruput is 0.9997 with only 0.00005 containing four or more errors. See 
Table 20 for the complete numbers. 

As we remarked above the important distribution for evaluating performance 
of interleaving is the probability of the joint events (x^ r, £ r, ..., 

X^ s r] for the error correcting capacity r of the code employed. Details of 
this and some other strategies of burst correction will be given in a seperate 
paper . 
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Section VI 


BLOCK (SYMBOL) ERROR DISTOIBUTION 

There are some error correcting algorithms in which a block is considered 
as being made up of symbols, each symbol is a fixed number of bits. Such an 
algorithm is designed, for example, to correct up to a given number of symbol 
errors in a block* Because the errors on the GCF occur in clusters therefore, 
it may he more efficient to employ this type of algorithm rather than use one 
that is designed to correct only bit errors* Another reason for looking at 
symbols instead of the individual bits will be demonstrated presently when we 
consider the problem of acquisition and maintenance of synchronization on the 
GCF* 

But even without consideration of forward error correction it is almost 
clear that dividing a block on the GCF into symbols of appropriate length is a 
more efficient way to take advantage of the burst noise for feedback and re- 
transmission. For in doing so, we shall need only ask for and retransmit the 
symbols within each block that are received in error instead of having to re- 
transmit the whole block. Retransmitting only the error symbols will particu- 
larly be preferred in cases when, although the number of bit errors in the block 
is higher than the error correcting capability of the code employed, the errors 
are all confined to within only a few symbols. 

For a symbol of length s let us first find the statistics we shall 
employ to estimate the performances of the different algorithms designed for^ 
correcting symbol errors. Then we shall treat the problem of acquisition and 
maintenance of synchronization on the GCF. Specifically we shall look at the 
following statistics: 
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(i) P^(k,n): the distribution of error symbols in n-symbol word 

p^(o'^) = probability of error-free n-symbol word 
s Ic i 

(ii) P (0 1 1) j the probability of k error-free symbols following a 

given error symbol, 

p®(0^1 1 1) = the symbol gap distribution 

(iii) R (k) = p(symbol k in error | ini tial error symbol) or the corre- 

lation of symbol errors* 

In the sequel a symbol is considered to be in error if at least one of 
its bits is in error* All the above expressions are derived in the Appendix V. 
(i) Distribution of Error Symbols in n-Symbol Word 

This is given by* 

n-k 

P®(k,n) = P® L P®(0^|1) P®(k-1, n-jJ-l) (Al) 

i=0 


where 


T®(k-l,n-i^-l) 


n-k-ji 

z 



(ii) 


is ; 


Symbol Gap Distribution 

The probability of k error-free symbols following a given error symbol 


P®(0^|l) = 


S c.p 

1 


i 


d-p") 

?;ti^ 


s 




c,(l-pf) 

p® _ p V' i ^ 

'^1 - h ^ p\(i-p^) 
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s 2 

_ sjj (1-Pi) 

^ 'A 

^Pirr^rr 


s 

Pj^ is the symbol error rate and Pj^ is the bit rate. For s = 6, 8 or 10 in 
a block of length 1200 bits, P^(k, n) was found for the Red, Amber and Green 
groups and the overall H-F and W-B channels ( Table 21). If we use a code that 
corrects up to two symbols say, the table shows the proportion of blocks that 
would contain more symbol errors than the capability of the code and may have 
to be retransmitted. For exajmple, for symbol length s = 6 bits in the Red 
(10 bit rate) group about 0.6^ of the blocks would be in this category. To 
achieve the same error rate we would have to be able to correct up to 6 bit 
errors in the 1200-bit block if we use a BCH code and we would need to use 
about 66 parity check bits to do it. At present only 33 bits are allowed for 
error detection and correction capability on the GCF. So that even if we use 

all the 33 bits for error correction alone we cannot correct more than 3 bit 

<■ 

errors in the 1200-bit block. For the same symbol error correction capability 
the longer the symbol length, of course, the less the proportion of blocks left 
uncorrected and the longer the parity check bits of the BCH code that will give 
the same error probability (see Table 21). 

As said in a number of places already our intention in this report is not 
to present detailed study of appropriate coding strategies (including feedback) 
for the GCF. This we intend to do in a separate paper. The above trade-off was 
mentioned only to emphasize the importance (and efficiency) of symbol error 
correction on the GCF. 


JPL Technical Memorandum 33-699 


87 



JPL Technical Memorandum 33-699 


CO 

CO 


Table 21. Syni)ol error rate (Group Statistics), The proportion having at least three 
symbol errors and the number of bit errors that give the same proportion in 
a 1200-bit block considered as made up of symbols of length s =6, 8 or 10 bits. 



P®( 0 ,n) 

s=6; n=^200 
P® P®(^ 3 ,n) 

Number of 
bit errors k 
giving equi- 
valent value 

of P®(s 3 ,n); 

& (p(ak,1200)) 

s=8; 11=150 
P® P^(^ 3 ,n) 

Number of 
bit errors k 

giving equi- 
valent value 
of P®(^ 3 ,n); 

& (P(sk, 1200 )) 

s=10; n=120 
P^ P®( 2 = 3 ,n) 

Number of 
bit errors k 
giving equi- 
valent value 
of P®(a3,n); 

& P (&k, 1200 )) 

Red 

.9935 

.00066 

.0061 

6 

( .0061) 

.00073 

.00595 

7 

(.00596) 

.000779 

.00586 

8 

(.00586) 

Amber 

.99788 

.000084 

.00162 

5 

( .00165) 

. 932 (- 4 ) .00153 

6 

(.00155) 

.00010 

.00145 

7 

(. 00146 ) 

Green 

.999624 

.966(-5) .00024 

5 

(.00025) 

•ll{- 4 ) 

.00022 

6 

( .00023) 

, 12 (- 4 ) .00021 

7 

(. 00020 ) 

Av* 4.8 

.9978 

.000122 

.00183 

5 

(.00183) 

.000135 

.00176 

6 

(.00176) 

.000145 

.0014 

7 

( . 0017 ) 

Av. 50 

.99932 

. 976 (- 4 ) .000644 

12 

(. 000642 ) 

.00011 

.000637 

13 

(.000635) 

.00012 

.000631 

14 

( .000628) 



50 


NUMBER OF ERRORS k ► 

Fig, 16, Distribution of symbol errors C^veraged 4 
channel; symbol length - 6 bits) 


c_ - 


8 kbps 
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NUMBER OF ERRORS k 


Fig, 17, Distribution of symbol errors Coverall 50 kbps channel; 
S5mibol length = 6 bits) 
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log P (>k SYMBOL ERRORS) 



Fig, 18, Distribution of symbol errors C^veraged 4.8 kbps; 
symbol length = 8 bits) 
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NUMBER OF ERRORS k 


Fig, 19, Distribution of symbol errors (overall 50 kbps; 
symbol length = 8 bits) 
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)09 P ( > J< SYMBOL ERRORS) 



Fig, 20, Distribution of s3mibol errors (averaged 4,8 kbps; 
symbol length = 10 bits) 
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bg P ( > k SYMBOL ERRORS) 



Fig, 21, Distribution of symbol errors (overall 50 kbpsj 
symbol length = 10 bits) 


94 


JPL Technical Memorandum 33^699 



This section will not be complete without a word about the autocorre- 


lation of symbol errors denoted by r^(k)j k = 0,1,2,.*., i.e., r®(k) = P(symbol 
k in error I initial symbol error)* For if the symbol errors are too highly 
correlated then we may not be able successfully to shorten the buffers at the 
transmitter and receiver since we may be forced to store for retransmission 
many blocks that are in error because occurrence of an error symbol may cause 
a high number of others to occur in quick succession* On the other hand if the 
capability or the code employed for forward error correction can handle most of 
the bursts when they occur it would be desirable to have high values of r®(k) for 
small values of k, i.e., just as in bit autocorrelation we would want to have not 
much longer bursts of symbol errors than the capability of the symbol correct- 
ing code so as not to have a problem of buffer over-flow for a buffer of 
'^moderate^^ size* More detailed consideration of this problem will be studied 
in another paper* 

In Appendix V it is shown that (iii) is given by: 



where 

C •= (Cj^j ^ 2 * ^ 3 * 

: transition matrix (Fig. 2) 

^4x4 * transitions between the good states only 

(obtained by deleting the last row and column of M) 
S = M - (last column of M) 

U = column vector (4 x 1) of I's 

and 
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0 


Q * 


/ 


V 


1-p! 


s-1 


1-p 


1-p 


s-1 


1-p, 


1-p: 


s-1 


1-p, 


l-pj‘^ 


1 - 




For s = 6, Table 22 displays values of r^(k) for k = 3, 200. It is seen 
that the highest correlation exists in the Red (xlO error rate) group. In 
this case it is more probable (0.57) for a symbol error to be correlated with 
another three symbols away. This correlation reduces to less than 0.08 when 
k = 200 the symbol error probability is 0.00066. In designing a feed-back and 
retransmission strategy for the GCF note should be taken of the high symbol 
error correlation in each of the error groups for k = 3 (and indeed for all 
moderately small values of k) and of what we shall call a burst of block 
errors in the next section* It is only in the Green (xlO bit rate) group that 
r (1200) - 0.00001 is closest to the symbol error probability (0.0000097) show- 
ing as in the case of bit correlation that symbol memory is almost 1200. We may 
further observe that as symbol length becomes longer (for example s - 10 in 
Table 22) the correlation is weaker for small values of k. That is to say that 
in a great number of bursts of errors on the GCF the longer the symbol lengths 
the more likely it is that all the errors are contained within (i.e# affects) 
only a single symbol. Obvious fact*. But no such general statement can be made 
for large values of k except in the case of the Red error group where an error 
symbol of length s = 10 bits is less likely to effect an error in another 
symbol at distance k = 200 away (than in the case when the symbol length is 
6 bits) . 
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Table 22. Group symbol-error correlation. The 

probability starting with initial symbol 
error (with probability P^) of getting 
another symbol error in position k = 3 
or 200 for symbol lengths s = 6, 10. 



r^(k) for s = 6 

. ■ 

r®{k) for s 

= 10 

k 

3 

200 


3 

200 


Red 

.5722 

.075 

.00066 

.554 

.069 

.000779 

Ttober 

.4841 

.0028 

.000084 

.456 

.0032 

.00010 

Green 

.4031 

.00001 

•966(-5) 

.358 

.000012 

.12(-4) 

Av. 4.8 

.5304 

.02 

.000122 

.51 

.0155 

.000145 

Av. 50 

.5643 

.1054 

.9 6(-4) 

.61 

.087 

.00012 
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NUMBER OF SYMBOLS k 


Fig, 22, Auto*-Correlation of symbol errors C^veraged 4.8 kbps; 
S 3 ^bol length = 6 bits) 
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NUMBER OF SYMBOLS k 


Fig. 23* Auto-correlation of symbol errors (overall 50 kbps 
symbol length == 6 bits 
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iv. Sync Acquisition and Maintenance Probabilities 

We are now ready to consider in some detail the problem of sync acquisi- 
tion and maintenance on the GCF. Of course it is understood that all we can do 
here is to exhibit the probabilities of each of a number of strategies that are 
now being proposed for reacquiring synchronization once it is lost and for 
maintaining it after it is acquired. The hardware problems are beyond the scope 
of this work'. 

Specifically we intend to compare the performances of two strategies both 
based on using a prefix sequence of 2k bits in each of the 1200-bit blocks. 

1. The first strategy proposes to accept sync if there are not more than 3-bit 
errors in the prefix sequence. 

2. The second accepts sync if there is at most only one error symbol in the 
24- bit prefix considered as four 6- bit symbols . 

Our criterion of comparison shall be the efficiency of each of the 
algorithms in reacquiring a lost sync within a frame of 1200 bits after it is 
lost and of maintaining it once it is reacquired. Then we shall explore ways 
of improving the algorithms. 

Certainly if the 24-bit prefix of a block agrees exactly with the sync 
sequence we would have no doubt but that we have acquired synchronization. It is 
when this agreement is not perfect that we are forced to make a decision as to 
whether the errors causing the disagreement are due only to the channel noise or 
to the fact that the 24-bit sequence was not originally the sync sequence. At 
the extremely high bit rate which will be in use it is not unacceptable to search 
every 24-bit segment of the incoming data until one of the segments "looks 
enough" like the sync sequence that we are reasonably sure that we have located 
the start of a new block. How many errors in a 24-bit segment should we attribute 
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bit positions. And if we are told that the sync sequence contains some errors 
all we can really assume is that any one of its bits could be the one in error . 
So that each bit is as likely to be in error as any other bit and then it is as 
likely to be in error as it is not. Thus we see that the probability of false 
detection does not depend on the error rate of the channel. Hence to compute 
this probability we can treat the prefix 24-bit sequence as composed of 
independent equally likely bits. 

We draw several conclusions from our tables, 

1. Table 23(a); For n = 24, even in the worst (Red) error mode, 

the algorithm for acquiring sync, once it has been lost, which looks 
for a 24-bit sequence that looks like the sync sequence up to only 
1 bit error would fail to detect sync less than 0.1% of the time within 
one block after the loss of sync. And in only about 0.l8% of the time 
will this algorithm lock onto the wrong synchronization. However, the 
algorithm that allows up to 3 errors in the sync sequence will lock 
onto the wrong synchronization in over l6% of the time although it will 
hardly fail to identify the true sync sequence. Table 23(b) tells the 
story for the wide-band 50 kbps circuit. 

2. On the other hand the algorithm which looks at the 24-bit sync sequence 
as four 6-bit symbols and locks onto synchronization if not more than one 
symbol is in error will lock onto the wrong sync in less than 2% of the 
time and is equally as efficient as the first algorithm in not failing to 
identify true synchronization. This algorithm is equally strong for the 
wide-band channel (see Tables 24(a) and (b)). 

We can explain our conclusions in 1 and 2 by the fact that we are dealing 
with a burst noise channel which is very good indeed during the good modes (see 
the high thruput in each case analyzed - Tables 23-24), When the errors do 
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to the action of the channel noise and still be well protected against locking 
onto wrong line sync? In other words how well must a segment look enough like 
the sync sequence? For if we refuse to lock onto sync any time the 2k bit 
segment tested disagrees with the sync sequence we may miss acquiring the sync 
because of the noise in the channel. We call such a mistake FAILURE TO DETECT 
sync. On the other hand if our algorithm is not stringent enough, in a frame 
of 1200 bits a 2U-bit segment which is not the sync sequence might be accepted 
as one. This mistake is referred to as FALSE DETECTION of sync. 

Probability of failure to detect sync (Tables 23-26} is given, in the 
case of the first algorithm, by 

Pr(failure to detect using the bit count) = Z) P(k,n) (^3) 

k>t 

where P(k,n) computed in ( 37 ) is the probability of getting k bit errors 
in an n-bit sync sequence and k^ is the detection level for the algorithm. 

For the second algorithm this probability is given by: 

Pr(failure to detect using algorithm 2) = S P®(k,n) (hh) 

k>t 

where P (k,n) given in (^H) is the probability of k symbol errors in an n- 
symbol sync sequence. We present these probabilities for each algorithm and each 
test run, each of the Red, Amber and Green error modes and the averaged high- 
frequency end wide-band channels. To compute the probability of false detection 
we use the fact that if our algorithm allows up to k errors in the sync 
sequence then there are times we acquire sync that we are really choosing any one 
of the other sequences of 0*s and I’s that differ from the sync sequence in ^ t 
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occur however they do so in bunches. The second algorithm uses this fact by 
allowing up to five errors provided they all occur in only one synobol. 

3. We increase the length of the sync sequence to 30 and 36 bits to see 
when the first algorithm can be efficient (see Tables 25 and 26 ). At 

n = 30 bits we can very safely allow up to 3 errors in the syne sequence. 
This procedure will lock onto the wrong sync within one block of sync 
loss in less than 0.5^^ of the time while it provides ample protection 
against failure to identify the right sync sequence. Still for n - 30, 
even allowing up to four errors will lead to wrong identification in 
less than of the time compared to l6^ for allowing just one error at 
n = 24. See the rest of Table 24 for the results for the Amber, Green 
and the averaged channels . As expected the efficiency of the first 
algorithm improves with longer sync sequence. Look at the case for 
n * 36 . 

4. In the case when the sync sequence is taken to be 5 symbols (30 bits) 
instead of four it is hardly possible to lock onto the wrong synchroniza- 
tion with the algorithm that allows up to one symbol error. Within a 
block of loss of sync this algorithm will not fail to identify the sync 
sequence if there is one and will lock onto the wrong sync with about the 
same probability ( <0.0009). But to allow up to 2 symbol errors will detect 
the wrong sync in about 5^ of the time (see Table 26). 

We designate by THRUHfr the probability that the sync sequence passes 
through error- free emd it is shown in Tables 2 3-24. Then the probability that 
the sync is maintained once it is acquired which depends on the criterion for 
sync acquisition being used is the thri;^ut plus the probability of getting ^ t 
errors in the sync sequence. This probability is very high indeed for each of 
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the algorithms examined. Thus we have based our comparison only on their 
strength of reacquiring synchronization within a frame of 1200 bits after it is 
lost. 

(v) Conclusion 

For a sync sequence of length n = 24 bits the algorithm that allows not 
more than one symbol error is preferred to the one that allows up to three bit 
errors . By looking at symbols instead of individual bits we can reduce the 
probability of false detection of sync sequence from l6% to less than 2^ with 
correspondingly lower probability of ever failing to lock onto the right sync. 

We are even much better protected against ever making a wrong decision 
if we allow the sync sequence to be up to 5 symbols long. In this case the 
probability of ever making either type of error is less than 0.0009. To achieve 
this reliability we cannot allow more than two bit errors in the 30-bit sync 
sequence if we employ the bit count as our criterion of reacquiring synchroniza- 
tion once it is lost. 

It is therefore suggested that we look into the possibility of using a 
sync sequence of 5 symbols so as to take advantage of the efficiency of the second 
algorithm. 
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Table 23(a). Probabilities of failure to detect and of false detection 
of sync for algorithm which looks at 2U-bit prefix and 
allows errors. (H-F ^.8 kbps) 



. 1 

k 1 

Detection Level 
2 3 

4 

Prefix 

Thruput 

Prob. of 
false detect 

.00175 

.021 

.163 

.9078 

i 

i 

— 

Red 

.00093 

.00083 

.00072 

.00062 

.99896 



Amber 

.00012 

.00010 

.87E-4 

.71E-4 

.999858 



Green 

.15E-4 

.12E-4 

.96E-5 

.74 e -5 

.999981 



Av. 48 

.00017 

.00015 

.00013 

.00011 

.999806 



1 

.00229 

.00189 

.00152 

.00118 

.99728 



2 

.000024 

.000019 

.000014 

.0000099 

.999971 



3 

.000026 

.000025 

.000023 

.000022 

.999973 



4 

.00018 

.00016 

.00014 

.00012 

.999802 



5 

.00015 

.000114 

.000087 

.00006 

.999816 



6 

.000032 

.000029 

.000026 

.000023 

.999965 



7 

.00016 

.00015 

.00013 

.00012 

.99982 



8 

.000016 

.000012 

.000009 

.000007 

.999981 



9 

.44 e -6 

.88 e -7 

.i 4 e -7 

.19E-8 

.999998 



10 

.77E-5 

.6IB-5 

.47E-5 

•34 e -5 

.999990 



11 

.66E-4 

.6 ib -4 

.55E-4 

.5OE-4 

.999928 



12 

.ioe -4 

.ioe -4 

.98E-5 

•95E-5 

.999989 



13 - 

.i 6 e -4 

.15E-4 

.15E-4 

.i 4 e -4 

.999983 

PROB. 

14 

.92E-4 

.78 e -4 

.65E-4 

.52E-4 

.999894 

OF 

15 

.21E-3 

.17E-3 

.12E-3 

.82E-4 

.999734 

failure 

16 

.HE- 4 

.39E-5 

.12E-5 

.28E-6 

.999975 

ivJ 

DETECT 

17 

.49E-4 

.45E-4 

.4 oe -4 

•35E-4 

.999945 



18 

.i 4 e -4 

.13E-4 

.12E-4 

.ioe -4 

.999984 



19 

.45E-4 

.36 e -4 

.28 e -4 

.22E-4 

.999944 



20 

.6 oe -4 

.23E-4 

.73E-5 

•19E-5 

.999874 



21 

.62E-3 

.6OE-3 

.56E-3 

.53E-3 

.999347 



22 

.i 4 e -3 

.80E-4 

.4 ie -4 

.18 e -4 

.999789 



23 

.00044 

.00041 

.00039 

.00036 ■ 

.999542 



24 

.00015 

.0001 

.00006 

.000029 

.999795 



25 

.87E-4 

.79E-4 

.70S-4 

.6IE-4 1 

.999903 



26 

.00028 

.00023 

.00019 

.00016 , 

.999679 



27 

.00025 

.00022 

.00018 

.00015 ' 

.999707 



28 

.00010 

.98E-4 

.92E-4 

.87E-4 

.99989 


J 

29 

.4gE-4 

.45E-4 

.4 oe -4 

.36 e -4 I 

.999946 
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Table 23(b). Probabilities of failure to detect and of false detection 
of sync for algorithm which looks at 24-bit prefix and 
allows ^terrors (W-B 50 kbps) 





1 

Detection Level 

2 3 

4 

Prefix 

Thruput 


Prob. 

false 

of 

detect 

.00175 

.021 

.163 

.9078 

i 

i 

Av. 50 kbps 

.00015 

.00013 

.00011 

•96e-4 

.99983 



1 

.00013 

.00012 

.00012 

.00011 

.999862 



2 

.81E-4 

.70E-4 

.Y4e-4 

.70E-4 

.999916 



3 

.35E-4 

.33E-4 

.32E-4 

.30E-4 

.999964 



4 

.54e-5 

.45E-5 

•37E-5 

.3OE-5 

.999993 

PROS. 

5 

.67E-5 

•65E-5 

.62E-5 

.59E-5 

.999993 

OF 

6 






FAILURE 







TO 

DETECT 

7 

.70E-5 

.66e-5 

.6IE-5 

.57E-5 

.999992 



8 

.6ie-5 

.58E-5 

.56E-5 

•53E-5 

.999993 



9 

.73B-5 

.69E-5 

.65E-5 

.62E-5 

.999992 



10 

.49E-4 

.47E-4 

.44e-4 

.42E-4 

.99995 



11 

.71B-4 

•69E-4 

.66e-4 

.64b- 4 

.99993 

} 


12 

.0011 

.00096 

.00078 

.00059 

.998706 
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Table 24(a). Probabilities of failure to detect and of false detection of 

sync for algorithm which looks at a prefix of four 6-bit symbols 
and allows St errors (H-F 4.8 kbps) 


Detection Level 

k 1 2 3 

Prefix 

Thruput 

Prob. of 
false detect 

.0177 

1 

1 

i 

i 

Red 

•00079 

.000546 

.00029 

.998974 



Amber 

.0001 

.63E-4 

.29B-4 

.999858 



Green 

.12E-4 

.96e-4 

.26E-5 

.999982 



Av. 4.8 kbps 

.00015 

.96e-4 

.47E-4 

.999803 



1 

.00186 

.0011 

.44b-3 

.997276 



2 

.193B-4 

.12E-4 

.44e-5 

.999972 



3 

.27E-4 

.2270-4 

.16b-4 

.999968 



4 

.00016 

•00011 

.623E-4 

.999798 



5 

.00011 

•586e-U 

.225B-4 

.999814 



6 

.31B-4 

.225B-4 

.13E-4 

.999961 



7 

.00015 

.00012 

.68b-4 

.999821 



8 

.13B-4 

.791B-5 

.30E-5 

.999981 

PROB* 

9 

xio”^ 

xlO"*^ 

xio"® 

.999997 

OF 

10 

.74e-5 

.40E-5 

.15E-5 

.999988 

FAILURE 

11 

.49E-4 

.38e-4 

.24e-4 

.99994 

TO 


.75B-5 

.69E-5 

.58b-5 

.999992 

DETECT 

13 

.19E-4 

.17E-4 

.14e-4 

.999979 



14 

.74b- 4 

.48e-4 

.22E-4 

.9999 



15 

.00018 

.00011 

.38E-4 

.99974 



16 

.85B-5 

.19E-5 

.i8e-6 

.999976 



17 

.4oe-4 

.32B-4 

.i8b-4 

.99995 



18 

.12B-4 

.834E-5 

.53B-5 

.99997 



19 

.34e-4 

.18b-4 

,68e-5 

.999943 



20 

.46E-4 

.UB-4 

-llB-5 

.999875 



21 

.00059 

.00051 

.00038 

.99933 



22 

.00011 

.45E-4 

.88E-5q 

.999786 



23 

.00040 

.00033 

.00023 

.99955 



24 

.00013 

.59B-4 

.i4e-4 

.999798 



25 

.784B-4 

.6ie>-4 

.350-4 

.99999 



26 

.00022 

.00014 

.63E 4 

.99968 



27 

.00021 

.00013 

.650-4 

.99971 



28 

.96e-4 

•82e— 4 

.624e-4 

.99989 


f 

29 

.44b-4 

.351E-4 

•22B-4 

.999946 
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Table 24(b). Probabilities of failure to detect and false 

detection of sync for algorithm which looks at a 
prefix of four 6-bit symbols and allows fit errors 
(W-B 50 kbps ) 





Detection Level 






: 1 

2 

3 



Prob, of 




prefix 


false detect 

.0177 

1 

1 

Thruput 

i 

i 

Av. 50 Kbps 

.00012 

.79E-4 

•36e-4 

.999845 



1 

.00011 

-95B-4 

•72E-4 

.9998686 



2 

.812B-4 

.69E-4 

.52E-4 

.999908 



3 

.35E-4 

.292E-4 

.22E-4 

.9999606 



U 

.llB-5 

,6 iie -6 

.26 e -6 

.999997 

PROB* 






OF 

5 

.89E-5 

.782E-5 

.63E-5 

.9999899 

FAILURE 

TO 

DETECT 

6 

.39E-5 

.33E-5 

.261B-5 

.9999955 



7 

.47E05 

.37E-5 

.27E-5 

.999994 



8 

.79B-5 

.69E-5 

.55E-5 

.999991 



9 

.64e-5 

•53E-5 

.42E-5 

.999993 



10 

.48e-4 

.4ie-4 

•30E-4 

.999945 



11 

.55E-4 

.49E-4 

.4oe-4 

.999939 

1 

r 

12 

.001 

.00072 

.00031 

.99867 
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Table 25 


Probabilities of failure to detect and of false detection 
of sync for algorithm which looks at a prefix of length 
n = 24 , 30 or 36 bits and allows Sk errors. 


n 

k 

Prob . of . 

False 

Detect 

Red 

Prob. 

Amber 

of Failure to 
Green 

Detect 
Av. 4.8 

Av. 50 

24 

mn 

.00175 

.00093 

.00012 

.15E-4 

.00017 

.00015 



.021 

.00083 

.00010 

.12E-4 

.00015 

.00013 



.163 

.00072 

.87E-4 

.96E-5 

.00013 

.00011 



.9078 

.00062 

.71E-4 

.74e-5 

.00011 

■96e-4 



.9078 

.00062 

.71E-4 

•74e-5 

.00011 

.96E-4 

30 


.00051 

.00092 

.00012 

.l4s-4 

.00017 

.00015 



.0049 

.00082 

.00010 

.12E-4 

.00015 

.00013 



.0348 

.00073 

.86e-4 

♦93E-5 

.00013 

.00012 



.1901 

.00063 

.71E-4 

•73E-5 

.00011 

.99E-4 



.8371 

.00054 

.58E-4 

.56E-5 

.9E-4 

.82E-4 



1 

.00045 

.46e-4 

.42E-5 

.73E-4 

.66e-4 

36 i 

— 1 

3 

1 

.00013 

.00091 

.00011 

.13E-4 

,00016 

.00015 


4 

.0011 

.00082 

.99E-4 

.HE- 4 

.00014 

.00013 


5 

.0075 

.00073 

.85E-4 

.89E-5 

.00013 

.00012 


6 

.04l 

.00064 

.71E-4 

•72E-5 

.00011 

.00010 


7 

.182 

.00055 

.59E-4 

.56E-5 

. 92E-4 

.86e-4 


8 

.694 

.0004? 

,48b-4 

•43E-5 

•77E-4 

,71E-4 


9 

1 

.00039 

.38e-4 

•32E-5 

.62E-4 

.57E-4 
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Table 26 . Probabilities of failure to detect and of false detection 
of sync for algorithm which looks at a prefix of n 6 - bit 
symbols (n = 4, 5 , 6) and allows errors. 


n 

k 

Prob. of 

false 

Detect 

Red 

Prob. 

Anber 

of Failure 
Green 

to Detect 
Av. 4.8 

Av. 50 

■ 


.0177 

.00079 

.0001 

.12E-4 

.00015 

.00012 

■ 

2 

1 

.000546 

.63E-4 

.64e-5 

.96E“4 

.79E-4 

5 

1 

.00034 

.00089 

.00012 

.14e-4 

.00017 

.00014 


2 

.044 

.00066 

.795>*4 

.84e-5 

.00012 

.0001 


3 

1 

.00044 

.47E-4 

.45E-5 

.75E-4 

.62E-4 

6 

B 

.64e-5 

.00097 

.00013 

.15E-4 

.00018 

.00015 


H 

.0010 

.00076 

.93E-4 

.IOB-4 

.00014 

.00012 


H 

.086 

.00056 

.62E-4 

. 6 IE -5 

.97E-4 

.83E-4 


H 

1 

.00036 

.35E-4 

.3IE-5 

.59B-4 

.48e-4 
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Section VII 


BURST DISTRIBUTION 

From what has been said up to now it is clear that the single most im- 
portant distribution on the GCF is the Burst Distribution. Indeed it is precisely 
because o£ this reason that attempts are being made to design forward and feed- 
back error correcting strategies that have the capability of removing these 
bursts of errors for improved communication. The fact is that absolutely every 
step must be taken to understand the nature of these bursts: how long they 
are, how dense the errors within them are, and particularly how many standard 
1200-bit blocks are affected each time the channel enters into this bursty mode. 
These and related questions will be answered in this section. 

But before we start, let us try to fix ideas of what exactly we shall 
refer to as a burst of errors. We define a burst as a sequence of bits (a) be- 
ginning and ending with an error, (b) separated from the nearest preceding and 
following error by a gap of no less than some number, say G, called the guard- 
space, and (c) containing within it no gap equal to or greater than the guardspace. 
Immediately a number of questions spring to mindo For example, how large 
must a guardspace be? Or for a given set of data what is the criterion for 
choosing an optimum value of G? The fact is that since optimality of G can be 
defined only with respect to a set of criteria which in turn are based on what 
we consider important, a value of G which is optimal in one sense may necessar- 
ily not be optimal in some other. We shall illustrate this point with two 
examples : 

1. Following Stern’s intuitive reasoning in [15] the optimum G should sep- 

arate the data into bursts with a density of errors which is much higher 
than the "background" error rate. The background error rate is used here 
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to ref<^r to the error rate which would result If each sequence of bits 
defin(?d as a burst were called and replaced with a single bit error. 

The proportion of bursts in a test, the density of errors in a burst 
and the distribution of burst lengths all should not be too sensitive 
to the Ct that is optimum. By considering the variation of the back- 
ground error rate with G alone it is possible to find a range of G 
values that leaves the background error rate constant. Take as optimum 
G any value in this raqge. 

W’e are primarily interested in choosing as guardspace that G that will 
divide the data into bursts a high proportion of which are less than the 
burst correcting capability of the code employed on the channel. To be 
more precise let b be the burst correcting capability of a code C. 

By this we mean relative to the guardspace G, b is the largest integer 
for which every noise sequence containing only bursts of length b or 
less is correctly decoded. See Gallager [l 6 ]. So In this case we will be 
interested in choosing a G for which b is maximal for the code C. If 
for example, the capability of the code C is as high as G i.e., 
b = then using the relation 

G ^ Ij^ 

b ■ 1 -R 

in fl61, which connects the rate R, G and B, we see that the rate R, 
cannot be more than 0 , 5 . Which is too low of course. To achieve a rate 
R of 0 . 9 , b cannot be more than g/ 19. That is, the error correcting 
capability of a code with rate R = 0.9 cannot exceed b = G/ 19 . 
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Table 27(a). Optimal guardspace, 4800 bps data 





G = 3600; 

b = 1200 



G = 48 Q 0 ; 

b = 1600 



^ of 
burst 

# 

s b 

# 

> b 

max, 

burst 

^ Of 
errors 
in max, 
burst 

# of 
burst 

< b 


max. 

burst 

4 of 
errors 
in max. 
burst 

# of 
burst 

# 

<: b 

# 

>b 

max. 

burst 

^ of 
errors 
in max, 
burst 

1 

322 

151 

171 

6133 

l 4 l 

100 

79 

21 

217362 

3550 


60 

25 

217362 

3550 

p 

li 

10 

1 

150 

63 

11 

11 

0 

150 

63 


11 

0 

150 

63 

3 

8 

5 

3 

535 

246 

7 

7 

0 

1158 

9 

7 

7 

0 

1158 

9 

h 

93 

79 

14 

1077 

126 

89 

87 

2 

6 l 64 

477 

86 

81 

5 

616U 

477 

5 

122 

90 

32 

654 

27 

109 

103 

6 

7433 

14 ? 

109 

103 

6 

7433 

147 

6 

23 

18 

5 

565 

272 

20 

20 

0 

884 

ik 

19 

18 

1 

4780 

32 

7 

109 

67 

22 

861 

299 

76 

62 

ih 

4787 

15 

66 

47 

19 

12866 

309 

8 

22 

21 

1 

148 

66 

13 

8 

c; 

>■ 

3667 

12 

13 

10 

3 

3667 

12 

9 1 

10 

9 

1 

435 

6 

8 

7 

1 

1615 

15 

8 

7 

1 

1615 

13 

10 

7 

1 

6 

1740 

70 

5 

3 

2 

5082 

213 

5 

3 

2 

5082 

213 

11 

1 

1 

0 1 

47 

22 

- 

- 

- 

- 

- 

- 

- 

- 

- 

- 

12 

8 

4 

4 I 

570 

l 64 

5 

3 

2 

2432 

352 

5 

4 

1 

2432 

352 


28 

20 

8 

868 

17 

15 

11 

4 

6205 

143 

13 

10 

3 

14291 

248 


214 

201 

13 ' 

1249 

36 

174 

154 

20 

9268 

33 

I 

167 

149 

18 

15562 

39 


23 

23 j 

0 : 

27 

6 

23 

23 

0 

27 

6 : 

1 

23 

23 

0 

27 

6 


37 ! 

33 

4 : 

783 

324 

27 

21 

6 

4559 

51 1 

26 

20 

6 

5456 

43 



2 

2 

738 

109 

h ’ 

4 

0 

738 

1 

109 I 

4 

4 

0 

738 

109 


31 

27 

4 

389 

15 

20 : 

18 

2 

3654 

2 k 

19 

17 

2 : 

6359 

3 


37 

36 

1 

143 

6 

28 

1 

23 

5 

3760 

12 '■ 

26 

20 

6 i 

4520 

9 


34 

12 

22 , 

1347 

537 

21 1 

17 

4 

8805 

1117 

20 

15 

5 

8805 

1117 

21 

29 

5 

4 

i 

305 

28 

18 ' 

13 

i 

5 

1 

! 

4742 

1 

12 

18 

13 

5 

4742 

12 
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Table 27 (a) - Continued 




G = UOO; b 

= 133 



G = 3600; 

b ^ 1200 



G = 4800 ; 

b = 1600 



# of 
burst 

# 

£ b 

# 

> b 

max. 

burst 

■ 

# of 
errors 
in max. 
burst 

# of 
burst 

# 

< b 

# 

>b 

max. 

burst 

■ 

# of 
errors 
in max* 
burst 

# of 
burst 

# 

< b 

# 

>b 

max. 

burst 

# Of 

errors 
in max. 
burst 

22 

102 

39 

63 

2506 

535 

55 

33 

22 

13718 

906 

50 

31 

19 

22459 

979 

23 

!+7 

37 

10 

5157 

346 

27 

17 

10 

6178 

367 

27 

18 

9 

6178 

367 

2 k 

18 

17 

1 

518 

236 

16 

l 4 

2 

2126 : 

12 

16 

l 4 

2 

2126 

12 

25 

213 

186 

27 

1265 

444 

183 

169 

i 4 

14761 i 

105 I 

180 

164 

16 

14761 

105 

26 

109 

87 

22 

1944 

295 

89 

76 

13 

6355 

162 

86 

72 

l 4 

1 1167 

289 

27 

21 

1 

2 

611 

286 

19 

18 

1 

2464 

6 

19 

18 

1 

2464 

6 

28 

13 

i 

L- 

9 

4 

1758 

211 

I 

11 

10 

1 

5588 

223 

11 

10 

1 

1 

5588 

223 
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Table 27(b). Optimal guardspace, 50 kbps wideband data. 


■ 


n 

b = 133 



G = 3600; 

b - 1200 



G = 4800 5 

b = 1600 


1 

# of 
burst 

« b 

# 

>b 

max* 

burst 

# Of 
errors 
in max. 
burst 

# of 
burst 

# 

b 

>b 

max. 

burst 

# of 
errors 
in max. 
burst 

# of 
burst 

^ b 

>b 

max. 

burst 

# of 
errors 
in max. 
burst 

1 

34 

17 

17 

473 

248 

30 

27 

3 

378 

80 

28 

25 

3 

7018 

112 

2 

13 

4 

9 

2008 

681 

11 

8 

3 

4651 

98 

11 

8 

3 

4651 

98 

3 

15 

6 

9 

302 

193 

12 

10 

2 

3570 

160 

12 

10 

2 

3570 

160 

4 

2 

0 

2 

l6i 

23 

2 

2 

0 

161 

23 

2 

2 

0 

161 

23 

5 

3 

2 

1 

210 

95 

3 

3 

0 

210 

95 

3 

3 

0 

210 

95 

6 

i 2 

1 

1 

266 

61 

2 

2 

0 

266 

92 

2 

2 

0 

266 

92 

7 

2 

1 

1 

176 

90 

2 

2 

! 0 

176 

90 

2 

2 

0 

176 

90 

8 

2 

0 

2 

346 

i42 

2 

2 

0 

1 346 

142 

2 

2 

0 

346 

142 

9 

3 

' 2 

1 

319 

65 

3 

3 

0 

319 

65 

3 

3 

0 

319 

65 

10 

21 

9 

12 

326 

! 143 

19 

18 

1 

3507 

155 

19 

18 

1 

3507 

155 

11 

17 

7 

10 

687 

160 

12 

9 

3 

3506 

332 

12 

9 

3 

3506 

332 

12 

100 

50 

1 

1 

50 

13636 

! 2361 

27 

16 

11 

70931 

5603 

26 

16 

10 

1 

124y44 

10477 

1 




So our criterion for choosing a G which is optimum for a code C 
with rate R is to choose that G for which the lengths of a desired 
proportion of the bursts are at most b bits where b ^ G. We cannot 
allow G to be too large even though theoretically that would enable us to 
correct all the bursts. The compromise is to find an implementable code having 
a desired R and then a G giving a maximal b for that code and for the 
frame size (or in5>lementable multiple thereof) on the channel. For the GCF the 
frame-size is 1200 bits. 

We do not want to leave the reader with the in5>ression that b can always 
be used as criterion of the effectiveness of a code against burst noise. For 
example, on a channel where long bursts containing relatively few errors are far 
more likely than short bursts containing many errors one would prefer a code 
capable of correcting the likely longer bursts at the expense of the less likely 
short bursts. 

As an illustration of the magnitude of G we are talking about I have 
fixed a b = g/ 3 and then found the value of G for which a high proportion of 
the bursts have lengths less than or equal to b. The results are contained in 
Tables 27(a) and (b). In both the high-speed and wideband circuits a guardspace 
of UOO bits is too short for identifying bursts. It does seem however that in 
both cases a G of 36OO bits is adequate. For this value of optimal G the 
bursts are longer in the 4800 bps data (maximal burst length is 217,362 bits) 
than in the 50 kbps data with a maximal burst length of 70,931 bits. 

We now find the average length of the bursts, the density of errors within 
them and how many 1200-bit blocks are affected each time the channel enters a 
burst. Specifically we shall calculate the 
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( i) distribution and mean of burst lengths 

(ii) P(k errors in a given burst of length n) and its mean 

and 

(iii) the block burst distribution. 

i. Distribution and Mean of Burst Lengths 
Let 


L(n) = P(burst of length n) . 

Then as shown in Proposition 1 of Appendix VI, L(n) is given by 


L(n) 


1 0 for 
|U(G)L(n); 


n < 0 
n > 1 


(i^6) 


where 


_ min(G-l,n-2) _ 

L(n) = S V(;e)L(n-i!^l); n s 2 

L(l) = 1 


and UCk), V(k), k s 0 are given by (11). 

Typical test runs give too few bursts to make comparison between the model 
and empirical distributions realistic. For instance for G = ^<-00, 15 of the 29 
error runs at 4800 bps have less than 30 bursts each; for G = 3^00 bits, 19 of 
the runs each has less than this number. It therefore seems that the appropriate 
burst distribution to compare with the data is a variant of (46) which is a func- 
tion of the total length of bits transmitted in the particular run. Nevertheless 
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comparative graphs of model and empirical distributions are plotted (typical 
ones are shown in Figures 28, 29 and 31). 

The mean burst length is given by 

S nl(n) = U(g) S nL(n) (4?) 

n^l n&l 


= U(G) 


d-V(,a))^ 


and the variance is 


U(G)R'’(1) + U(G)R'(1) - [U(G)R’(l)f ; 


(48) 


where 


1 -^1 


C. 




and, R'(l) and R"(l) are" given by expressions (VI. 13) and (VI.lU) in the 
Appendix. 

The wide variation in burst lengths in each of the error runs (between 1 
bit and the maximum lengths shown in Table 27(a) and (b)) explains the relatively 
low mean burst lengths and high standard deviation in each of the error groups 
(Table 28). It also explains why the (model) standard deviation is higher than 
the mean, it being possible to have no binrst at all in a run (indeed) 
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Jog P {BURST OF LENGTH AT LEAST N) 



BURST LENGTH N, bits 


Fig, 24. Distribution of burst lengths C4.8 kbps linej 
Red group; G = 400) 


JPL Technical Memorandum 33-699 


119 




Fig, 25, Distribution of burst lengths C4,8 kbps line; 
Amber group; G = 400) 
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log P (BURST OF LENGTH AT LEAST N) 



Fig. 26. Distribution of burst lengths (4,8 kbps line; 
Green group; G = 400) 
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log P (BURST OF LENGTH AT LEAST N) 



Fig. 27. Distribution of burst lengths (50 kbps line; G = 400 j 
error rate = 0.52 x 10~^ ) 
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log P (BURST OF LENGTH AT LEAST N) 



Fig, 28, Distribution of burst lengths C4.8 kbps line; 
Red group; G = 3600) 
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log P (BURST OF LENGTH AT LEAST N) 





lo9 P (BURST OF LENGTH AT LEAST N) 


P 



BURST LENGTH N, bits 

Fig, 30, Distribution of burst lengths (4,8 kbps line 
Green group; G = 3600) 
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Fig, 31, Distribution of burst lengths C30 kbps line; G 3600; 
error rate = 0,52 x 
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there are two such runs without errors). The discrepancy between the data and 
model mean burst lengths can be explained in terms of the modem fixed errors 
we talked about earlier; note that the data values are therefore expected to 
be higher than their predicted values (as Table 28 shows). 

Table 28. Mean burst length, i<-800 bps circuit, G *= toO bits 



Data 

Model 

Group 

mean 

length 

mean 

length 

standard 

deviation 

Red 

389.0 

340.0 

402 

Amber 

94.0 

70.0 

82 

Green 

50.0 

4i.o 

4l 

Average 4800 bps 

185.7 

135.0 

177 


As Tables 27(a) and (b) show, most of the bursts occurring are very short. 

In both the 4800 bps high-speed and the 50 kbps wideband data, a high percentage 

of the bursts is less than 133 bits in length (using a guardspace of 400 bits). 

We note here also the effect of the modem fixed errors which, at this value of 

G, gives a large number of bui'sts exactly 24 bits in length (each random error 
causes errors in exactly l8 and 23 bit positions away from it) and containing 
exactly three errors , 

ii. Distribution of Errors in a Burst and Its Mean 

Using a guardspace of only 40 bits it has been shown in [l] for the 4800 
bps data that the ratios bad/good bits in the bursts average to 4l'5&, kkio and 45fa 
for respectively the Green, Amber and Red error groups with overall average of 
about 44'jS. Let us see here how the guardspace affects this error density in the 
bursts . 
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To do this we find the distribution of errors in a burst. Then by 


Proposition 2 of the /appendix we have 


P(k errors in a given burst of length n) = 


I(n) 


where 


Q(k,n) 



0 if k = 0, n ^ 0; or k > n 

1 if n = k = 1 

min(G-ljn-k) _ 

S V(i!.)Q(k-l, n-X-L); n s k s 2 


^0 


(^ 9 ) 


and L(n) is given by (^6). 

The mean number of errors in a burst of length n, K^, is given by 


1 if n = 1 or 2 

n _ _ 

Tj L(k)L(n-k+l) 


L(n) 


(50) 


\'le pick an n = 2k00 and plot the graphs of the probability of errors in a 
burst of length n for G = 400 and 360 O bits and each of the error groups. 
Typical plots are shown in Figures 32, 33 and 34. The mean number of errors for 
this length of burst and the bad/good ratio are shown in Table 29 which also 
shows that the bad/good ratio decreases with increasing guardspace as should be 
expected . 
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NUMBER OF ERRORS N 


Fig, 32, Distribution of errors in a burst {averaged 4 
line; burst length = 2400 bits; G = 400) 
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log P (NUMBER OF ERRORS IN BURST OF LENGTH N) 




NUMBER OF ERRORS N 


Fig, 34 t Distribution of errors in a burst C^veraged 4.8 kbps line 
burst length = 2400 bits; G = 3600) 
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log P (NUMBER OF ERRORS IN BURST OF LENGTH N) 



Fig* 35, Distribution of errors in a burst (overall 50 kbps 
line; burst length - 2400 bits; G = 3600) 
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Table 29* Mean number of errors, in burst of length n ^ 2^00 


Group 

G = UOO 

G = 3600 

K 

n 

Density 

Bad/Good 

K 

n 

Density 

Bad/Cood 

Red 

239 

0.01 

0,11 

132 

0.055 

0.058 

Amber 

25^ 

0.11 

0,12 

36 

0.015 

0.015 

Green 

371 

0.155 1 

0.183 

22 

0.009 

0.009 

Av. U800 bps 

1 211 

0.088 

0.096 

68 

0.028 

0.029 

Av. 50 kbps 

' 302 

i 1 

0.126 

i 

O.lUU 

199 

0.083 

0.09 


iii. Block Burst 

Lastly we ask for the distribution of block bursts • By block-burst 
mean, a string of blocks starting and ending with an error block, separated from 
the nearest preceding and following error block by a gap (of error- free blocks) 
of not less than the block guardspace and containing within it no gap equal to 
or greater than the block guardspace. 

If d = block guardspace, L (n) = probability of a block- burst of length 
n (each block is s bits long), then as shown in the Appendix 



0 for 
U®(d)L®(n); 


n < 0 
n ^ 1 


(51) 


where U^(d) and 
Taking S ^ 
find that there is 


L""(n) are as given in ( 152 ) and ( 153 )• 

1200 bits and using a block guardspace d = 10 blocks we 
chance of getting a block-burst extending to 10 blocks or 
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more in the Red error mode. This probability reduces to less than in the low 
(Green) error mode. Graphs of this distribution for the different error modes 
are shown in Figures 36, 37 and 38 . 
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NUMBER OF BLOCKS IN A BURST, N- — ► 

Fig. 36. Block burst (4,8 kbps linej Red group; guardspace - 10 blocks) 
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log P (BURST OF AT LEAST N BLOCKS) 



} 20 40 60 


NUMBER OF BLOCKS IN A BURST, N < 

Fig, 37, Block burst C^.8 kbps llnej Amber group 
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Figt 38, Block Burst C4t8 kbps linej Green groupj guardspace = 10 blocks) 
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Section VIII 


CONCLUSION AND REMAINING PROBLEMS 


Although we have demonstrated that the simplified five-state Markov chain 
(Figure 3) can be used successfullj'- to model the errors occurring on the GCF 
we still want to insist that a more realistic model should incorporate the 
varying line conditions caused by users coming onto or dropping off the line 
at different times. As indicated in Section I this can be done by placing 
appropriate probability distributions both on the varying number of users on 
the channel which causes the changes in line conditions and the times between 
these changes. 

In oi;r model we have taken note of these line changes by dividing the wide 
variations in bit error- rate into three main error groups; the Red, Amber and 
Green modes. All statistics calculated for each group are valid for such a 
group, as it has teen shown in this study. That is why the emphasis has not 
been on the averaged channel parameters for either the high-speed or the wide- 
band circuit since they cannot be expected to depict all the three error groups 
with any measure of reliability. Our success in modelling rather accurately not 
only the individual test runs but also groups of runs gives us a lot of confi- 
dence in all the predictions based on those model parameters. 

Of course the problem still remains as to which of the Red, Amber or Green 
group of statistics to employ at any given time. Our answer? You guessed right’. 
Be conservative, take a pessimistic view of the channel by using the Red- group 
model statistics in designing all error correcting (and detecting) schemes not 
only because of the protection it provides but also because this is the error 
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group that our model depicts most accurately* In our model, the higher the 
error-rate the closer the agreement between the model and empirical statistics. 

This is as it should be since we are interested only in the bursty mode of the 
channel . 

Statistics, other than those displayed in these pages, exist for estimat- 
ing the performance of all error correcting (and detecting) codes and for construct- 
ing feedback (and retransmissiot^ schemes. Indeed work using some of these sta- 
tistics has already begun. 

We still want to think that the highlight of this study is not the con- 
struction of the five-state Markov chain as model for the GCF. Rather it is the 
development of the Maximum Likelihood procedure for estimating the model param- 
eters using an iterative method. This procedure, as has been shown, is applicable 
to any finite Markov chain. 

It is hoped that the reader has not been left with the impression that all 
the problems attendant to the model have been solvedi Aside from the problem of 
constructing a model with the fluctuating line conditions built into it along the 
line sketched above, there are other problems of both theoretical and practical 
interest that are not totally solved yet^ We list a few below. 

1, One difficulty we have with our model is finding a physical justification 

for allowing only one error-state B in which errors occur with proba- 
bility one. It is fairly well-known that the error-causing mechanism 
does not reverse the bit each time the channel enters into a burst. Our 
explanation was that a bursty state is represented by transitions between 
states B and sojourn in either of them being allowed. A direct 
way to model what happens in the physical channel would be to allow two 
or more states in which errors can occur. Let errors occur in one of the 
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states with probability 1/2 (given that the state is reached) to account 
for those times when, in the midst of otherwise good transmissions, 
random errors occur. The other error-state is the burst state in which 
errors occur with some probability 0 < h < 1 to be considered as one 
of the channel parameters to be estimated. 

The problem is that the expression (84) for the capacity of the channel 
is valid only for the case of one error state. Although (74) is true, 
in general, the capacity cannot be evaluated directly from it. The 
problem of finding analytic expression for the capacity when there is 
more than one error state is still open. 

Our attempt has been directed at finding bounds on the entropy H in 
(74) by finding upper and lower bounds on the function h(z^^ , . « . ,z^) in 
(80) and showing that these bounds are close enough for large n. 

2. Blackwell and Koopmans [8] showed that for a 4x4 irreducible aperiodic 

Markov matrix M satisfying some mild conditions, the function P(M,z) 
in (10) of the error sequence z - [zn) fixed (the same) for 

different matrices M. In the terminology of our model this means that 
if we had used a four-state Markov model satisfying the specified condi- 
tions it is possible to have different sets of Maximum Likelihood estimates 
of the model parameters giving the same joint probability distribution, 
F(M,z), for the same error sequence z» Then our interest in knowing all 
such transition matrices becomes apparent. For two seemingly different 
sets of estimates may indeed be equivalent or two different test runs may 
be recognized as two samples from the same underlying distribution. 


JFL Technical Memorandum 33-699 


141 



Blackwell and Koopmans found a finite set of functions f, , • • ♦ ,f, 

1 ^ k 

each defined on the set of all irreducible aperiodic Markov matrices M 
such that and are equivalent (in the sense that z) = 

P(lip, z)) if and only if functions f they call 

A COMPLETE SET OF INVARIANTS . For the 4x4 matrix there are only eight 
quite- easily-checked such functions (probabilities). 

Although the conditions for their result do not all apply to our case 
there is enough structure in our model to enable us to find the complete 
set of invariants. VJe shall return to this problem in a seperate paper. 

3 . The buffer problem in feedback retransmission . 

To people familiar with digital communication this is not a new 
problem. It has been considered in different forms by different peoplej 
indeed a number of schemes are now being developed to reduce the buffer 
sizes both at the transmitter and the receiver when feerjback retransmission 
method is employed for error correction on the CCF. Ne describe one such 
scheme here. 

Imagine we have, along with our channel, a reverse (feedback) link, 
from the receiver to the transmitter, of low capacity available for our use. 
Let data be transmitted along the channel at constant rate R, data being 
supplied to the transmitter at rate < R. The receiver delivers the 
received data blocks in sequence j each time a block is received with an 
error in it the receiver sends to the transmitter through the feedback link 
a request for retransmission. A copy of the error block and all subsequent 
blocks received are stored in a buffer at the receiver until the error block 
is retransmitted and received correctly. This is how the receive buffer 
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fills up. Let T be the loop time, i.e., the time for a trans- 
mitted block to reach the receiver and a retransmission request relayed 
to the transmitter if the block is received with an error in it. It 
therefore follows that all the blocks entering the transmitter, at 
constant rate R^, after each block is transmitted will have to be 
retained at the transmitter buffer for at least time T by which time 
the transmitter will know whether or not the block has passed through 
error-free. We assume the requested retransmission is done as soon as 
possible after the request is received. 

During long block-bursts when successive blocks are hit with errors 
many blocks of the incoming data may have to be stored at the transmitter 
buffer. On the otherhand at low block error-rate when only a few blocks 
are received in error or when a single block is received in error after 
repeated retransmission, the transmitter buffer does not need to store 
more than just a few blocks while data is piling up high at the receive 
buffer . 

The problem is to find the distribution of the mumber of blocks that 
will be stored in the buffers at the transmitter and the receiver, using 
our model statistics for the channel parameters. Or equivalently, we may 
fix a buffer length at N blocks, say, and ask the probability that the 
number of blocks that have to be stored will exceed N, 

While transmission is going on, good data are being delivered to the 
user in sequence as they arrive at the receiver. If this cannot be done 
because of requested retransmissions of blocks received in error, the user 
can wait for a maximum of 8 seconds. After this time all the data in the 
receive buffer, both good and bad blocks, are delivered to the user. 

What is the block error-rate of the data eventually delivered to the user? 
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Appendix I 


In this appendix details of the maximum likelihood estimation procedure 
used to obtain the optimizing parameter set from the data will' be outlined. 

The steady state probabilities of the finite state Markov chain (M*C) 
with transition matrix M is given by i = 1, 5 where 



and 


C. Uc 

u = • 

i * 

1-p. 

1 


i * 1, 4 


u 


5 








Since errors occur only in state B, the bit error probability is given 


P 


1 



The fact that 
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k ^ 0 


U(k) = 


4 

2 


k-^1 

c.p. 

11 


implies 

h c, 

S ^ = iXo) = 1 . 

i=l 


Wow let us show how to get a MLE of Pj^, say, from the data. Suppose, 

as we did in (7), we assume that for the process to be in state the length 

of the gap is at least k bits (G, is the best error- free state). Then since 

o x 

U(k^) = P(0 °|l) 

= S V(k) ; V(k) = P(0^l|l) 
kSk 

o 

the conditional probability of getting a gap of length k 3 k^ is given by 


where 


l&L . opi'-i 


uny 


c - 




k -1 

' '"ih 


Suppose there are I gaps of lengths k^, k^, k^ such that 

kj^ £ k^, i = 1, i. Then the joint conditional probability of getting the 

i gaps is given by; 


JPL Technical Memorandum 33-699 


147 






which implies 


I 

S k.-& 

1 " ° 

a 

S +i 
1 " ° 


( 53 ) 


148 


JPL Technical Memorandum 33-699 



So in general 




(5M 


where k.., k. , N., and N,. are, as defined in Section on Parameter 
ji’ i’ 11 ’ 

estimation, given by: 

kj^ = length of gap j, j = 1, in state i, i = 1, 

k. = the threshold to state i 
1 

N = number of errors in the run 
e 

and = number of gaps of length zero. 

Suppose we are given a sample of size K of observations from a finite state 

M*C ' in the form of an error sequence Z = fz ] Our aim is to determine 

^ '■ n^n=l,W 

the transition matrix M from the sequence Z. The general form of the following 
procedure was proposed by Baum and Welch in [7] but in that paper no mention was 
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made of a way to get the initial estimates with which to start their iterative 

A 

method. Our iterative method for getting the matrix M will use the and 

A 

obtained above as initial estimates. 

The probability of getting the sequence z^, z^ (using the 

structure in the 31 samples) if M is the transition matrix can be written as; 

Ne-1 

p(M, z) = p,u(je)u(L) n V(ji ) (55) 

j=l ^ 


subject to U(0) - 1. 

(Our notation here is as used in the section on Parameter Estimation.) 

Let denote the true value of M. We would like to take as our esti- 

A ^ 

mate M of M a value of M which maximizes P(M, Z) . It is possible for 

A 

distinct M to yield this maximum (see Blackwell and Koopmans [8]). But we shall 

A 

content ourself with getting any one of such M. 

Now, the M maximizing P(M, Z) also maximizes 


Ne-1 


U e~ - 1 

log — = log S C pf"^ + log S C.p^^ 4 S log S c, (1-p. )p.'^ 


i-1 


1 1 


1 1 


(56) 


j=l 


subject to 


u(o) = S — 

P. 


1 . 
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Put 


c. 

1 


Cl (1-Pi) 




and let 


c.p^ 

1 i i-Pi 


c.p^ 

X -^1 


- J 

O'. = Z) c.p. 

J ^ i 1 


( 57) 


Then ( 56 ) can be written as: 


N^-1 


log = 10g(X + lOgOL + Z> logo*. 

^1 1 L j 


( 58 ) 


. 

subject to the condition G ; 2-=-^ = 1. 

~ 1-p. 


Using method of Lagrange's multipliers we have 


V log 


P(M,Z) 


X V G 


where V is the differential operator. 


That is 


4.iog£fei 



O’, 


+ 





We-l 

S 

j 



O'. 



(59) 
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and 


ic,(l-P,)pf^-^OjPf ^ Lc,(l-Pi)pr^^OA . V . Vl 


L”X — L 


je.-i 
^ ^ j 


apj Pj, 


(i-Pj) P'1 


(i-Pi) P'1 


J O'. 
J J 


= X 


(X-Pl)' 


(60) 


(59)xSc^ gives 


X = + 1 

e 


(61) 


Dividing (sg) x by (6o) x we get 




C.p" 

11 


^i^i' 

+ s ^ ^ 


(1-piK 1 "'J 

... _. T77^ 

(l-p^)^Q'^ (l-p^)^a^ j ^ "j 


1-P. 


Xc.pf 


- jJ+1 
c.p. 
11 


Lc.p^' 

1*^1 


o 

C.p. 

1*^1 


(62) 


Denote the ntunerator and denominator of the LHS of (62) by and 

respectively. Then from (62) we have 


Pi 




(63) 


and 
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D, 




Since (59) x using (6l) is 


D 


1 




we can write for using (63): 

- 

°i ' (Ng+l)(C^+D^) ' 


(64) 


The expressions for and in (63) and (64) are the ones to use to 

iterate to get the optimum values of these parameters using as initial values 

the raw estimates obtained in (54). 

To get a single transition matrix M we consider the n runs as 

independent samples from the channel with transition M. Then the joint proba- 

n 

bility P(M, Z) of getting the sequence , Zp, z , where N = S N. is 

Id « j^l 


the total sum of bits in all the sangjles, is given by: 


P(M, Z) 


' ^1 n, 

k-l 



U(0) = 1 . 


(65) 
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The notation here is as used for equation(l7) • Our aim here also is to 
maximize P(M, Z). Using the method of Lagrange’s multipliers subject to the 
condition 


1 = U{0) 

i 

vre obtain 



and 


c. 

1 



c, 

1 


i-Pi 


(66) 


where = number of errors in sainple k 


n 


D, = 


2: 

k==l 


- "^Ik 
"i^i 


- ^ 
c.p. 

jti ~jk 


"ek-^ o.p6'' 

S " ' 


= 


n 

s 

k=l 


- ^Ik - \k"^^ 

^Ik^Pi . ‘^iPi 




- V- 


c,p, 

11 


“ek-^ 


c.p. 

D i. ^ ^ 


jk 


(T-p.Ta" 7 , "\2 ri-P~)c^iv ■" 7 ; ^2 ^jk 

(1-Pj^) of^ ' Ik (1-p^) 0=1 Ok 


and 


D = ^1 + ’Dg . 
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Appendix II 


We shall indicate here the method used to find the n-step transition 
matrix of the channel M. Here also, as in Reference [2j, we shall follovr 

the method in Feller [17] ♦ 

Let X and Y be the right and left eigen vectors of M with eigen value 
\ f 0, s = That is 


sMX = X (6?)i 

sYM = Y (6?)ii 


X = 



Y *= (y^, y^,, y^) 


Writing (i) and (ii) out we have: 


X. 

1 


s(l-p^) 

1-sp. 


i = 1, 


Xr- = 


s S c.x. + sqx^ : 
i=l ^ ^ 5 ’ 


- 1 - S 

i=l 




SC, 

1 

1-sp^ ^5 


y^ = s S (1-p )y + sqy 
^ i-1 ^ 


( 68 ) 
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Since x^(y^) is determined up to arbitrary multiplicative constant, 
we may put x^(y^) = 1. (Note that x^(y^) f 0, for that would make x. = 0 
for any sj also that (i) and (ii) have the same eigen values. So it suffices 
to find these solutions using ( 6 ?)i). Then 


1 - S C.S -s 

1 1 -sp, 


i=l 


+ s(i - i; c. 
\ i=l 


(69) 


Substituting \ = - we can write ( 69 ) as; 


X - 1 = S c. 

i=l ^ 


1-X 


x-p, 


(70) 


which shows that X = 1 is a root. Removing this root we have left; 


^ c. 

E 3 ^ + 1 = 0 

i=i 


(71) 


or 


■I 


3 


> X^-l E P. - 


U 

s 

i=l 


S cj . X 


1 2 

— (PpP3+P2Pl,+P3P4) + ^(PiP3-^PiPU+P3P4) 

) 




* ^ ^ 

Wu * 5; W. * 5; Wu " 5;; f 




= 0 . 
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Hence X = 0 is another root; and if we write the cubic equation as 


^ A p A A 

aX^ + bX^ + cX + d = 0 


standard method for solving cubics gives the three roots x^, as: 


- |(A - B)^ + (B + A)^ I + I 

||(A - B)^ + (B + a)^| + |(B + A)"^ - (A - B)^ 

||(A- B)^ + (B + A)^| - |(B + A)^ - (A - B)^j 


|J(A- B)^ + (B + A) 


(B + A)-^ - (A - B)‘ 


aO a*2 

<3 p-^ 

^ 2 ’ ^ T 27 


A A 

5 be . 2b'^ 

q = d - -y + yy ; p = c 


a = 1 


^(PjPj+PjP^+PjP^) + ^(p^Pj^p^P^^PjPl^) * 5^(P3,P2 +PiPi,^P2Pu) 


d = - ~ 


+ ~ 
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u)| cr> 



So the eigen values of ( 6? )i are 


X, - Xj 0^ x^j x^j 


Now (68) in terms of X becomes: 




1-Pi 


M) _ ^ 


T 


Xj -1 


Y, =I 


i . 1, U ; r * 1, 5 


Let Then 


,(n) _ V + 

'dk - A ^k V 


(73) 


where 


1 = t S 


v=l 


or 


t 

r 


1 + 


4 

s 


c. (1-p. ) 
1 1 


i-l 


i=l 
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If we write 


t = t 

® ^ \ -1 
r 


and for r = 1 , 3 j ~ * ^r “ ^ 1 ’ ^ 2 * ^3 


and 


t = 
o 


r h c. 


(- V 


' *rl\ =0 “ 


4 c. 

1=1 Pfj 


,-l 


3 t x" 
— r r 


- l-p^ ‘o - Pj’ (V-PjMV-Pk) 


= 1» ^ 


t X 


n 


^J5- *0* (1- 


tk" 

r r 


(n) ^ + j. « 

^54 “ 1-P^ o k ^ (V'Pj 


>) = t + S t X" 

55 o r r 

r 
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Appendix III 


Proposition 

The capacity, C, of the hurst-noise channel is given by 

C = 1 - H 


(74) 


where 


H = 


lim S P( 2 , 
n-«» Zj^-Oorl 


• • • > 




• * * > 




Proof ; The proof is by classical information- theoretic arguments. The mutual 
information of the n-extension of the channel is ; 


I(x", 'f-) ^ H(Y^) - HCY^Ix*^) 

where x(y) is the input (output) and H(») is the entropy function. The 
transmission rate is then 


B = 


lim 

Xi-xn 


I(X^.Y^) 

n 


and if p(x) is an input distribution, the capacity C is given by 


(76) 
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C = max R 

p(x) 


= max lim 
p(x) n-*<» 


H(Y^) _ HCY^'Ix”) 
n n 


Now for additive noise, i.e. Y = X + Z, Z the noise sequence, we can 
show easily that 


H(y")x'^) = H(Zj^, 


z 


n 


) 


independent of x'^ and that for p(Xj^, 
maximum equal to 1. But 



p-n H(I!1 
» n 


achieves its 


H(z 

lim 


9 * * 

n 



lim H(z^|zj^, ...» z^_^) . 

n-*OT 


So by (77), (78) we have 


C — 1 - lim H(z^|z^, •••, 

n-*» 

= 1 - H 


where H denotes limH(z^Izj^, ..., an^d 

n-*® 




z.=Oorl 
1 
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Now let us write H as 


with 


N lim f«a««z )hf z* § • » * * z ) 

f„ „ t 1 ' n' ' 1' ’ n' 

ri-Ko iZj^j • . . 


h( Z- 


,z ) = 
n-' 


z .,=Oorl 
n+1 




t„)logP(z^^|zj^,. ..,z^) 


(80) 


If we assume that our model has only one error state B then we can show, 
see Gilbert [^3, that h(z^, z^) can assume only (n+l) values 


h(o"), h(lo”’^), h(lo""^), h(10), h(l) (81) 

where {10 , k = 0, ,,,, n] is the event that an error is followed by k 
error- free bits. Using (80) and (81) we can write 


H = E P(10^)h(10^) . 
k=0 


( 82 ) 


In terms of U(k) and V(k), P(l0*^) is given by 

P(IO^) = Pj^U(k) 


and hence 
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io.(i . 


and 


H - 


k“0 


u(k). 


I u(k+1) 

U(k) 


, U(k+1) 


(i - - 


U{k+1) 

U(k) 


(8U) 


Note that H can also be written in terms of V(k) viz; 


V(k) = U(k) - U(k+1) 


CO 

H = - P, S V(k)logV(k) 
k=0 


( 85 ) 


although we shall not use this form in our calculations. 
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Appendix IV 


Block-Bit Statistics 

Expressions for the following statistics will be given in this section; 

(i) Proportion of blocks in error. 

(ii) P(k,n); probability of k errors in a block of length n. 

(iii) P^(k,n); probability of k errors in n Information digits 

"th 

on our channel sampled at every tr^ step, 

(iv) P(k bits between extreme errors in n-block|^ 2). 

We shall get (i) as a special solution to (ii). 


(ii) P(k, n) = P 


"^1 '^2 "^ T-l 

0 ho ••• 0 ^ ^10 


0 ^ ^ n k 


n-k 


VO 


P(0 -^1)P 


■^'3 h 1 h 

0 10 -^1 0 ^[l 


> ^ 


( 86 ) 

= n-k 


Now 


p ho h 0 ^|i 


n- ^^-1 ■ 


n-j6,-k . 

1 / Jl. 


= S 
^2=0 


(o 


^■3 ^ h- 

0-^1 0 10 ^jl 


N-je^-i- 


(87) 


where N = n-£^-l. Denoting the LPiS of (8? ) by P(k-1, n-jJ^-l), we have: 


n-i^-k 

P(k - 1, N) - D V(je^)P(k - 2, N - - 1) (88) 

^2=0 
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So from (86 ) ; 


P(k, n) = 


n-k _ 

S U(AjP(k 


- 1, n - 


\ - 1 ) 


(B9) 


where 


, n-^, -k 

n-k 1 _ 

= P^ S S U(jJjV(je )p(k - 2, -2) 

^ 1=0 ^ 2=0 


^1 = 


4 c, 

1+ s 


■ ^ i-p. 

1=1 -^1 


Method of calculating P(k,n) from the recursion (89 ); 

Note that ( 89 ) and ( 88 ) imply that it suffices to know, for every 

k & 2, 


{P(k-2, n-j), j - 2, ...» n-k+2; k £ n}. 


( 90 ) 


That is, it is enough to know 

{P(0, n-j); j = 2, n} , 


( 91 ) 


But 


P(0, n) = U(n) 


4 

= S 

i=l 


n-1 

c*p, 

11 


(92) 
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To find P(k, n) for k = 0, 1 observe that 


P(0, n) = 


j-1 ■> •> 


u . = 


1-P-, '^5 

tJ 


= P, S 


^ C . , 

J _ P >>-1 




U,^ = 


• 


Hence p(error block) = 1 - P(0, n) . For k = 1 (86) becomes; 


(93) 


n-1 _ 

P(l, n) = E U(i5.)P(0, n - jU, - 1) 

^ro 


and using ( 92 ) we have 


n-1 

P(l, n) = P S U(il )U(n - - 1) (9^) 

^ £^=0 ^ 


(iii) 


P^(k, n) 


Given the matrix 
reduces to (ii) with M 


- the t-step transition matrix, the problem 
replaced by Let 


U^(k) = P^.(0^(l) 
V^(k) = P^(o\|l) 


( 95 ) 
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be the gap statistics w.r.t. V?e want to find expressions in terms of 


for Write 


"k = "ill’ ! 


i = 1 , U . 


Then 


k 

U.(k) = E G (k) (96) 

t t,i 


Let us write 


M 


- (^10>5X5 


and M as the 4 x U matrix obtained by deleting the last row and column of 
Thus M are the t-step transitions between the good states only. Then 




" ^^51» ^52* P53* ^54^^ 


( 97 ) 


It is appropriate to note here that (97) is true in general whenever we are 
interested only in a sequence of error-free transmissions starting with an error. 
The only part of the original transition matrix to use is that denoting transitions 
only between the good states. Thus in calculating U(k) and V(k) w.r.t. the 
basic transition matrix M, we used the transitions between the good states 


P = 



0 

P, 

i 

0 

0 
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giving 



If then 

G^(k) - P(0^, Sj^ = gJi) 

4 

u(k) = S G (k) 

i=l 


we have 


(G^(k), GgCk), G^Ck), Gj^(k)) = (c^, c^, c^, 


i .e. 


U(k) = S c . 

i=l ^ ^ 

Wow 

P^(k, n) = P^jo ho^l •'♦ 0 ^“^0 

n-k Z (I Z . Z ) 

= S P. (0 H)P <0 ^1 ••• 0 ^ 10 ^|l> ( 98 ) 

Z^=0 ^ ^ ) 

L ^ k + 1 
0 ^ S n - k 

S £ . = n - k 

J 
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As for P(k, n) we find that 


P^(k-1, n-A^-1) 


= P. 


Z. ^ Z I 
0 ^10 • •• 0 ’^■■^10 ^lll 


n- Z^~ 1' 


n-j2^-k 


= S V^(je2)P^{k-2, n-Z^~Z^-2) 




and 


n-k _ 

P^(k, n) = P^(l) E U^(«^)P^(k-l, n-kj^-1) 


, n-k- A, 
n-k 1 


= P^(l) S S n-Z^-i^-2) 


Z^=0 Z^O 


(99) 


where 


v^(je) = u^(je) - u^(A+i) 


and 

P^{1) = P^(B) = Pj_ . (100) 

To prove ( 100) note that since 

iiM^ = = ♦*• = n 

where n = (Uj^> is the steady state distribution for M, II is 

also the steady state distribution for t S 1. Thus 
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P^(B) = 


4 

1 + S 

i=l 


= P, 



To evaluate ( 99 ) it is enough to know- 

P^(0, N) for N = 0, 1, . n . 


But note at once that 


P^(0, N) = U^(N) . 


(iv) P(k bits between extreme errors in n-block[s 2 errors); 


Denote this probability by Then by definition: 


P(k bits between extreme errors and ^ 2 errors in the block) 
P(^ 2 errors in the block) 


The numerator is equal to: 
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y = n-k-?-x 


= '"s'' p(o>‘i-JU.ia'^); 
x«0 


n-k-2 


- E P(0’'l)P(zj^^j^-l|z^=l)P(0^l0''l-^l) 
x =^0 


n-k-2 

P,r(k+1) E P(0’'(l)p(0^!l) 


x=0 


n-k-2 

P-r(k+l) E U(x)U(n-k-2-x) 


x=0 


where 


and 


r(k+l) = P(zj^^^=l]z^=l) 


U(k) = P(O^Il) . 


Also 


P(^ 2 errors In a block) = 1 - P(0, n) - P(l, n). 


That is 


n-k-2 

P^r(k+1) E U(x)u(n-k-2-x) 

_2 X =0 

1-P(0,n)-P(l,n) 


( 101 ) 
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Appendix V 


Block (Symbol) Error Distribution 

It is desired in this section to find for s.vmbols in a block all 

statistics we have found for bits makinc; up each s.ymbol. In this section we 
shall used symbol to mean a fixed number of bits and a block to be made up of a 
fixed number of symbols. Specifically we shall find expressions, in terms of 
channel parameters, for 

S I 

(i) P (0 jl); the probability of k error- free symbols 
following a given error symbol. 

S k f 

(ii) P (O ijl): the symbol gap distribution. 

(iii) P (k, n): the distribution of error symbols in n-s.ymbol word. 

s n 

(iv) P (0 ): probability of error- free n- symbol word. 

(v) r^(k) = P( symbol k in error | initial s;'/Tnbol error). 

Throughout this paper we shall assume a symbol to be in error if one or 
more of its bits are in error. It is convenient to start with 

P (O) P(no symbol error) 
where we assume each symbol is of length H bits. 


P®(0) 


= P(0, N) 


= P S u(k) 
kSN 


K-1 

c,p, 

= p, 

1 i i“P. 


Therefore 
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rr p( symbol error) 


N-1 

G.p. 

= 1 - P-. Zy ^ 

1 i 1-p. 


using 


i-Pi 


+ 1 


= 1 


and 


c. 



we have; 





+ 


e - 4 ^ 

i i-Pi 



k 

E 

i=l 



+ 


U 

E 

i=l 


c. 

1 




^ Y^N-l 

1-Pi 


^1 = ^l?p.(l-p.) 


( 102 ) 


Next let us find 


P^(lO^) - P(l“ symbol in error followed by 


error- free symbols) 


= h P(^ )P(o'®‘"''-J|i) 

j=K 
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where the indicated j bits are the nvunber of bits up to the last error in the 
error symbol . Thus 


P^IO^) 


j=N M 


. p, 

i ‘ ^ P?(l-P.) 


or 


P“(10‘) = S 0^' 


Nk-1 


1-P. 


Putting k=0 in (103) gives (101) . Further 


(i) 




M Nv 

^Vi -T^ 


(103) 


(lOU) 


Hence 


(ii) 


P®(0^l|l) - P®(0^jl) - 


P®(0^l]l) - 


^Vi 


mc-1 


1-p. 


(l-pf) 

^p^l-p, ) 


(105) 
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if sl a if 

(iii) P^(k,n) = P^jo ^10 0 ^^10 ^ 


n-k 1. I K 1 

= Tj P®(0 H)P®jo ••• 0 ^ll 


VO 




(106) 


0 « ^ n-k 

S je. = n-k 
L ^ k+1 


First we show that 


P®(0^1) = P^(IO^) 


We can write 


P®(0'^1) = S p(o”^'^^i» «) 

j=0 

where the indicated (K-j-l) bits are the remaining bits after the first error 
of the error symbol. Thus 


p®(o\) 


N-1 


= S p(0®'^''^‘i) 
J=0 


N-1 
R S c.p? 

V=0 ^ ^ 


NX+j-1 


> Nv 

^1 1 Vi -Tii- 


P®(10^) 


(107) 


by (103) . 
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Now^ 


-2t T ^ 6 ^ 

^ ^ L-1^ ^ 1 I T ( V' TJ® //\ I T \t>® ) A 3 t L- 1, „ 1 1 


n-^^-1- 


p jo ^"-^10 !i[ == S ? (0 i|i)p Jo ^i--*o 10 |i 


P^=o 


■n~ •^2_'" ‘ 


Denoting the LHS by P®(k-1, n-j?,^-l) we have 


n-k - ^ 

P®(k-1, n-i^-1) = E P®(0 ^l|l)P^(k-2, n-i.-i -2) 

je^=0 


Using this in (106)and because of (107) we can write: 


tT** ^ 

P^(k,n) = P® E P®(0 ^ll)P®(k-l, n-jl^-1) 

^,=0 


That is, to evaluate P (k,n), we need only know 


{P (k-2, n-j); J =-■ 2, n-k+2; k S 2) 


For k = 2 and any L 


P®(0, L) - P^(0, l!i) 
= P®(0^(l) 


/I w, 

Se.p'^l "i’ 


1 ' ‘ l-Il 


P (0, L) = 


M Nx 

(1-P- ) 

Ec. 


(108) 


(109) 


( 110 ) 
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For k > 2, use (l08)to find P^(k-1. n-£^-l). The case k - 0: 


P (0,n) = 


p'(o") 


^ S p^(io^) 


= S p, Sc.p" 

k:>n ^ i ^ " 


M N, 

Nk-1 

1-P,- 


VO.S 

by (W) 


1 


.e. 


(iv) 


c 


(11,1) 


The case k - 1 is included in (109). 

g 

We now find r (k): the correlation of error s,'/mbols. 

It is convenient to find 


I I 

p( symbol k error- free j initial error symbol) = P (0, |l ) 

X o 


and then use the fact that 


r^k) = P"(li,llo) 

= 1 • 


(IIP) 


Wow let 


c = 





S = 
U = 


(c^j C^j 

transition matrix, 

matrix of transitions betw^een the good states only 
obtained by deleting the last row and column of M* 
M - ( last column of M) • 
column vector (hxl) of l^s- 
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Then 


0,) . k-1— 0,) 


where the indicated (k-l) symbols are arbitrary N’(k-l) bits. Hence 


P®(lo> Oj^) = £ 

i=l 

+ P(i )(0, 0, 0, 0, 


(113) 


where the indicated (N-i) bits are the number of bits up to and including last 
error position in the error-symbol, i = 1, N-1. The case i = 0, when the 
last error in the symbol is in the last bit position is shown in the second term 
on the RHS. So 







+ P^(0, 0, 0, 0, 


Pj^(C0, 1)M^^^"^^SR^“^U 


(ll4) 


where 
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Thus since 






we have by (112) and (11^): 


r"(k) 


1 - -4 (CC, 1)m”^^"^^SR^"^U 
^1 


= 1 _ (CQ,1)M^^^“^^SR^~^U 


C, 


(i-p?) 


i P^(l-Pi) 


(115) 


Let us check (llU) by putting N = 1 and k = 1, in which case the LHS 


P^(l^, 0, ) reduces to P(10) . 

O K 


If K = k = 1, then Qs 0; 

becomes: P^(0, 0, 0, 0, l)lSIU ^ P^ S c^ = P(10). 


and so the RHS 
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Appendix VI 


Burst Statistics 

a. Distribution and mean of burst lengths 

b. P(k errors in a given burst of length n) and its mean. 

c. Block burst distribution. 

(a) Let L(n) = P(burst of length n); n 1. Then by definition 


\ 1 ? 

L(n) = PjO -^10 1 


T + 

10 ^ 10^1 


o’*! 


n-1 


where 

r, t s G the guard space, 

0 £ A £ G-1 
J 

S ^. < n-2 . 

J 

Proposition 1 

1 0 for n ^ 0 
U(G)L(n); n s: 1 

where 


(116) 


(117) 


min(G-l,n-2) 

L(n) = S Y(l)Un-t-l); n^2 

jJfO 


L(l) = 1 . 
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Proof ; 


It is clear by definition that L(0) = 0. By ( 116) 


L(n) = P[0 ^10 ••• 10 \o^l|ll 


!L n a 

= E p{o\o ^1 ••• 10 ^i|i}p(o*i|i) . 

t^G n-1 ► 


(II8) 


E P(o'*'ljl) = E c = U{G) (119) 

t^G i 


and if we denote 


L(n) = P{0 ^10 10 

◄ n-1 ^ 


min(G-l,n-2) I 1. 

E P(0 H|l)P{0 ^1 ••• 10 ^l|l} 


min(G-l,n-2) _ 

E V(£)L(n-A-l) (120) 

i=0 


so that L(1) = 1, then substituting (119) and (120) in (II8) gives (II7). 

The expected (average) burst length is given by 

E nL(n) = U(G) E nL(n) (121) 

n^l n>l 
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We shall find S nL(n) by the method of moment generating functions 
nStl 

(MGF)* . 


Let us find the H(S‘ of L(n). Define 


(V(je) if l ^ G - 1 

Vg(t) = j . (122) 

( 0 otherwise 


Since 


L(k) = 0 for k < 0 we have 

^ min(r7-l,n-2) . . . 

L(n)X*^ = X Zj « T 


V(£)X^(n-A-l)X 


i=0 


= X E V-(jt)X L(n-ir.l)x’ 
i^O ® 


n -^^1 


n a 2 


(123) 


and if we define 


B(X) s S L(n)x" ; V (X) h E V (£)x^ (124) 

n=l ^ £=0 


K(X) and V (X) exist because L(n) are probabilities and V„(&) is non-zero 
only for finitely many jt’s. Then S'oinining over n in (123) we get 

— 

We are grateful to Howard Rumsey for pointing out this elementary but powerful 
way of generating the function L(n) from its recursion expression (120). 
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= XVg(X)R(x) 


(125) 


or 


R(X) = 


X 

I'XVq(X) 


S X(XV (x))-^ 
j=0 ^ 


(126) 


Thus 


S nL(n) = U(G)r’(X)| 
nsl 1 X=1 


(127) 


and 


R'(X), 

'X=l 


l+X^Vg(X) 

(I-XVq(x))^ x=i Ci-V(5(i)]^ 


(128) 


We shall need 


r"(x) = 


{1-XV^(X))^[2XV^(X)+X^V"(X)]+2(1+XS^(X))(1-XV^(X))(V^(X)+XV^(X)) 


(1-XV^(X))‘ 


giving 




(l-V^(l))(2V^(l)+v"(l))+2(l+V^(l))(Vg(l)+V^(l)) 


(l-V^(l))' 


( 129 ) 
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in finding the variance of L(n), Thus, since 


r"(1) = E n(n-l)L(n) 
n&l 


the variance of L(n) is given by; 


U(g)^ E n^L(n) 
nsi 


U(g) E nL(n) 
n&l 


= u(g)r"(i) + u(g)r’(i) - (u(g)r’(i))^ ; 


(130) 


where 


so 


Vg(x) 


0-1 , 

S v(^)x‘ 

!;=0 


k 

E 

i=l 


Pi 



^ c.(l-p.)(l-(p.x)^) 

i Pi(l-PiX) 


V,(1)=E 

1 


c. (1-p?) 
1 1 

Pi 


c. (p. X)®)-G(l-p.X) (p.x)""'-"! 


G-li 


VgCx) = E 

i 


(l-PiX)*= 


(131) 


so 
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c. 


- P? - G(i - Pi)p?'M 

1 ’*^1 


(i:i?) 


v"(x) = 2 

i 


C.(l-P. )p.{2(l-(PjK)^)-G^(l-p.X)^(p.X)^"^-2G(l-p.X)(p.X)^'^} 

(l-P^X)^ 


SO 


v"(l) 


E 

i 


c.p.[P(l-p?)-G^(l-p^)^pf^-2G(l-p.)p^’^l 


( 1 - 33 ) 


(b) As in Ref. [2], we write 

P(k errors in a given burst of length n) = (1-3^+) 

L(n) 


and state expression for Q(k,n) 

in the following. 


Proposition 2: 


' 0 if 

k = 0, n 5 0; or k > n 


0(k,n) = < 

1 if 

min(G-l,n- 

n = k = 1 
•k) 

V(f)Q(k-l,n-j?/-l) ; n 2 k ^ 2 

( 135 ) 


Proof: 


P(k errors in a given burst of length n) 

P(burst of length n with k errors) 
P (bur St of length n) 


( 136 ) 
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Denote the numerator by o(k,n). Then by definition 


where 


Q(k,n) = P{0 Ho H 10 HoHjoH} 


r, t^G; Lck-1; S^. = n- k 

J 

0-1 

0 « jC . ts min(G-l, n-k~ Hi.) 
^ i=l ^ 


Thus by (ll8), 


i a t, 

0(k,n) = U(G)P{0 Ho H ••• 10 H|l} . 

^ n-1 ► 


Let 


Q(k,n) = P(0 Ho H ••• 10 H|l} . 

< n-1 ► 

Since the (n-1) bits indicated must contain (k-l) errors we have: 


Q(l,n) - 


1 if n = 1 
0 otherwise 


and 


Q(0,n) = 0 for n 2: 0 
and it is clear that 

Q(k,n) = 0 for k > n . 


(137) 


( 138 ) 
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Hence for 2 £ k ^ n we can write from (I3B) 


_ min(G-l,n-k) jL 

Q(k,n) = E V(£ )P{0 ‘^1 10 ^l|l} 

^ ^ n-£^-2 ^ 

min(G-l,n-k) 

= E V(/)Q(k-l, n-jg-l) (139) 

i^O 

since the (n-A-2) bits indicated must contain (k-2) errors. Thus by (137) 

Q,(k,n) = U(G)Q(k,n) . (lliO) 

Combining this with (117) and (136) we obtain 

P(k errors in a given burst of length n) = 

= 2^121 ; nil. 

L(n) 


Here also let us use the method of moment generating functions to find the 
mean number of errors in a burst of given length. Denote this mean by n 

is the burst length. So 

n _ 

S ko(k,n) 

ic = — ; n s k 2: 2 (l4l) 

L(n) 

We know that 

= 1 

Kg = 2 . 

By definition of Q(k,n) we can write 
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Q(k,n) = s n-£-l) 


(1-iS) 


£=0 


where V (^) is as defined in (\2P ). Note that (lUp) holds for all values 
G 

of k and n except for k = n = 1. So if we denote 


<50 CO 


Q.(yjx) £ S S Q(k,n)y^ x” 
k=l n=l 


E E Q(k,n)y^ x” + yx 
k^ n2:2 


(1^3) 


Q(y,x) is well defined since Q(k,n) are probabilities. We can write, for 
all k and n 

Q(y,x) - yx *= xy S E V (jt)x^ E Q(k-1, n--e-l)x” ^ 

k=2 Ib^O ^ n=2 

« xyV-(x) S E Q(k-1, n-;e-l)x"“-^'V"^ 

^ k=2 n-tAZ 


xyVg(x)Q(y, x) 


(144) 


or 


- i-xX(xy 




Denote the partial derivative of Q(y,x) w.r.t. y by Q^(y,x). Then 


ey(y,x) . — 


(l-xyVg(x)) 
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so 


Qy(l,x) = I R^(x) 


(:lH6) 




by (12^0 


By (1^3), for a fixed n 


S kQ(k,n) is equal to the coefficient of the term of Q (l,x) 


k^l 


i.e. the coefficient of the term of — 

X 


E L(n)x^ 
n=l 


which is equal to 


n 

E L(k)L(n-k+l) ( 14 ?) 

k=l 


Therefore by ( l4l) 


n _ _ 

E L(k)L(n-k+l) 

if = i 2 i_ ( 148 ) 

L(n) 


The variance of 


L(n) 


is given by the coefficient of the 


th 


term of 


where 


0 (Ijx) 

_yy 

L(n) 


K - 

n 


n 


(149) 




is the second partial derivative of Q(y,x) 
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w.r.t. y i.e. 


and 


Qyy(y,X) = 


2X^Vg(x) 

(l-xyV^(x)) 


3 


0 

yy ’ 


2X^Vq(x) 

(i-xv^(x))3 


= 2V^(x) ^ R^(x) 


(c) Let d = block guard space and L^(n) = probability of a block burst of 
length n (each block is s bits long). 

L®(n) = P®[0 Ho H ••• 10 HoHjoH} 

r,t ^ d ; 0 ^ i. d-1; 23 ji. £ n-2 

J 0 


Thus 



..•10 H[i}p®(oH|i) 


with 


E p®(oH|i) 

tsd 


E 

i 


sd 

HPi 


(i-P®) 

p3TI^ 


= lf{d) . 


(ipO) 


(151) 


(152) 
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Put 


L (n) = P {0 \ ••• 10 ’^lll] 


min(d-l,n-2) 

S 


V®(je)L®(n-£-l) 


so that L^(l) 


“ 1; where 


= p®(o^i|i ) 


■7^ ' ^1' 

2-/ c , p . / - \ 

i 1 p. (i-p. ) 


(i-p, ) 


1 


Sc. 

1 


Pi(l-Pj^l 


Then we can state L (n) in the following 


Proposition 3s 


(li'3) 


L"(n) 


0 for n ^ 0 
if (d)L®(n) ; n a 1 
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